“…To assess whether models can generalize across clinical distributions, we chose a wide variety of downstream CXR datasets that we used to finetune and validate our models. These datasets came from diverse sites, including Brazil 23 , China 24,25 , United States ,25,,26,27,28,29,30,31 , Spain 32 , Japan 33 , Vietnam 34 and other countries in Eastern Europe and Central Asia. 35 They also addressed a large set of tasks, such as lung nodule detection, line and tube placement, edema severity classification, pediatric pneumonia detection, pneumothorax detection and multi-class differential diagnosis; some datasets address wide-ranging detection tasks for multiple pathologies.…”