2022
DOI: 10.1038/s41597-022-01608-8
|View full text |Cite
|
Sign up to set email alerts
|

BRAX, Brazilian labeled chest x-ray dataset

Abstract: Chest radiographs allow for the meticulous examination of a patient’s chest but demands specialized training for proper interpretation. Automated analysis of medical imaging has become increasingly accessible with the advent of machine learning (ML) algorithms. Large labeled datasets are key elements for training and validation of these ML solutions. In this paper we describe the Brazilian labeled chest x-ray dataset, BRAX: an automatically labeled dataset designed to assist researchers in the validation of ML… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
3
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 16 publications
(4 citation statements)
references
References 21 publications
1
3
0
Order By: Relevance
“…Our label extraction algorithm was successful in identifying corresponding labels in DS 1 across all classes. The results are in line with other methods proposed in the literature 12 16 18 . Missed classifications were mostly due to missing phrases.…”
Section: Discussionsupporting
confidence: 91%
See 1 more Smart Citation
“…Our label extraction algorithm was successful in identifying corresponding labels in DS 1 across all classes. The results are in line with other methods proposed in the literature 12 16 18 . Missed classifications were mostly due to missing phrases.…”
Section: Discussionsupporting
confidence: 91%
“…To minimize development time, the CheXpert labeler was used to annotate the MIMIC-CXR data set as well. Moreover, this labeler has been adapted and ported to process reports in other languages, such as Brazilian [18] and Vietnamese [19]. The process of labeling consists of three stages: In the first stage, mention extraction, the labeler scans the report for phrases typical for a class as defined in class-specific lists.…”
Section: Introductionmentioning
confidence: 99%
“…To assess whether models can generalize across clinical distributions, we chose a wide variety of downstream CXR datasets that we used to finetune and validate our models. These datasets came from diverse sites, including Brazil 23 , China 24,25 , United States ,25,,26,27,28,29,30,31 , Spain 32 , Japan 33 , Vietnam 34 and other countries in Eastern Europe and Central Asia. 35 They also addressed a large set of tasks, such as lung nodule detection, line and tube placement, edema severity classification, pediatric pneumonia detection, pneumothorax detection and multi-class differential diagnosis; some datasets address wide-ranging detection tasks for multiple pathologies.…”
Section: Resultsmentioning
confidence: 99%
“…Brax: Reis ( 2022 ) introduced the dataset which includes 40,967 CXRs, 24,959 imaging studies for 19,351 subjects, collected from the Hospital Israelita Albert Einstein, Brazil. The dataset is labeled for 14 radiological findings using report parsing (NLP).…”
Section: Datasetsmentioning
confidence: 99%