2019
DOI: 10.1609/aaai.v33i01.3301590
|View full text |Cite
|
Sign up to set email alerts
|

CheXpert: A Large Chest Radiograph Dataset with Uncertainty Labels and Expert Comparison

Abstract: Large, labeled datasets have driven deep learning methods to achieve expert-level performance on a variety of medical imaging tasks. We present CheXpert, a large dataset that contains 224,316 chest radiographs of 65,240 patients. We design a labeler to automatically detect the presence of 14 observations in radiology reports, capturing uncertainties inherent in radiograph interpretation. We investigate different approaches to using the uncertainty labels for training convolutional neural networks that output t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

9
1,638
3
11

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 1,791 publications
(1,661 citation statements)
references
References 6 publications
9
1,638
3
11
Order By: Relevance
“…We note, however, that pneumonia is a diagnosis that is made in the context of clinical evidence of disease, and a disease where there is not necessarily perfect concordance between severity of symptoms and radiographic evidence of infiltrate [3,20,21,36]. In the description of the creation of the "Pneumonia" label in the CheXpert dataset, the authors note that while pneumonia is a clinical diagnosis, "Pneumonia... was included as a label in order to represent the images that suggested primary infection as the diagnosis, " suggesting that clinical information may play a role in labeling [13]. Disentangling the relationship between radiographic evidence of consolidation, the clinical presence of pneumonia symptoms, and the influence of the latter on the labeling of the former in these datasets could be helpful.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…We note, however, that pneumonia is a diagnosis that is made in the context of clinical evidence of disease, and a disease where there is not necessarily perfect concordance between severity of symptoms and radiographic evidence of infiltrate [3,20,21,36]. In the description of the creation of the "Pneumonia" label in the CheXpert dataset, the authors note that while pneumonia is a clinical diagnosis, "Pneumonia... was included as a label in order to represent the images that suggested primary infection as the diagnosis, " suggesting that clinical information may play a role in labeling [13]. Disentangling the relationship between radiographic evidence of consolidation, the clinical presence of pneumonia symptoms, and the influence of the latter on the labeling of the former in these datasets could be helpful.…”
Section: Discussionmentioning
confidence: 99%
“…To assess the robustness of models to dataset shift, we used chest radiographs from two large publicly-available datasets. For our model training source domain, we used the CheXpert dataset from Stanford [13]. This dataset contains 224,316 chest radiographs of 65,240 patients.…”
Section: Datamentioning
confidence: 99%
“…The second dataset used in this study is the CheXpert dataset, which has been released by Irvin et al in January 2019 and contains 224,316 chest radiographs of 65,240 patients [39]. This dataset was used to train a second GAN in order to demonstrate the feasibility of the proposed data sharing approach (see Figure 1).…”
Section: Dataset and Preprocessingmentioning
confidence: 99%
“…2. According to the labeling performance comparison mentioned in [39], most of the uncertainty labels (U) were assigned to 1.0, except for the consolidation class.…”
Section: Generating Labelsmentioning
confidence: 99%
See 1 more Smart Citation