2021
DOI: 10.1007/s10140-021-01954-x
|View full text |Cite
|
Sign up to set email alerts
|

Limited generalizability of deep learning algorithm for pediatric pneumonia classification on external data

Abstract: Purpose (1) Develop a deep learning system (DLS) to identify pneumonia in pediatric chest radiographs, and (2) evaluate its generalizability by comparing its performance on internal versus external test datasets. Methods Radiographs of patients between 1 and 5 years old from the Guangzhou Women and Children’s Medical Center (Guangzhou dataset) and NIH ChestXray14 dataset were included. We utilized 5232 radiographs from the Guangzhou dataset to train a ResNet-50 deep con… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
7
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 12 publications
(7 citation statements)
references
References 15 publications
0
7
0
Order By: Relevance
“…One of the limiting factors in the majority of published studies is the lack of testing on external datasets with the validation data being used instead for assessing the performance of the method [ 273 ]. This is a general issue in the field of AI and deep learning that could lead to “data leakage”.…”
Section: Discussionmentioning
confidence: 99%
“…One of the limiting factors in the majority of published studies is the lack of testing on external datasets with the validation data being used instead for assessing the performance of the method [ 273 ]. This is a general issue in the field of AI and deep learning that could lead to “data leakage”.…”
Section: Discussionmentioning
confidence: 99%
“…The challenges of obtaining and working with high-quality, external imaging data often stunt models from gaining the necessary validation required for eventual clinical deployment. Furthermore, even when external data are available, performance degradation is routinely observed when models are tested against external datasets [ 6 , 7 , 8 , 9 ]. Point-of-care ultrasound data presents additional unique challenges including various manufacturers, scanning presets, and probe types used based on individual institutional practices.…”
Section: Introductionmentioning
confidence: 99%
“…The significance of model initialization is amplified when we consider challenges in model generalization which are primarily due to feature distribution shifts between training datasets and real-world use. For example, a model trained and tested on adult CXR data from the same source (internal testing) may result in significantly higher performance compared to testing it on adult CXR data from another source (external testing) [11]. Additional performance degradation may be observed when pediatric images exhibiting the same disease(s) are included in the testing.…”
Section: Introductionmentioning
confidence: 99%
“…Model generalizability is defined as the ability of a trained model to capture generalized patterns and perform well on unseen data [18]. Medical computer vision relies on model generalizability for several reasons [11] including accommodating patient diversity, adapting to various data sources and quality, addressing ethical considerations, and enhancing clinical utility. A general model is robust to different data sources and population distributions, considering factors such as the patient/study subject's ethnicity, sex, and severity of the disease(s) expressed on the image.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation