2021
DOI: 10.1186/s12886-021-01992-6
|View full text |Cite
|
Sign up to set email alerts
|

Generalisability through local validation: overcoming barriers due to data disparity in healthcare

Abstract: Cho et al. report deep learning model accuracy for tilted myopic disc detection in a South Korean population. Here we explore the importance of generalisability of machine learning (ML) in healthcare, and we emphasise that recurrent underrepresentation of data-poor regions may inadvertently perpetuate global health inequity.Creating meaningful ML systems is contingent on understanding how, when, and why different ML models work in different settings. While we echo the need for the diversification of ML dataset… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
19
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
7

Relationship

1
6

Authors

Journals

citations
Cited by 19 publications
(19 citation statements)
references
References 17 publications
0
19
0
Order By: Relevance
“…emphasizes the need for ML models, especially those trained on data from a single source, to be validated on local, representative datasets. 33 This will pave the way for generalizability in ML applications for healthcare, while ensuring that the Global South is equitably represented in these solutions.…”
Section: Discussionmentioning
confidence: 99%
“…emphasizes the need for ML models, especially those trained on data from a single source, to be validated on local, representative datasets. 33 This will pave the way for generalizability in ML applications for healthcare, while ensuring that the Global South is equitably represented in these solutions.…”
Section: Discussionmentioning
confidence: 99%
“…Similar data sources combined across different studies create meta-studies to increase statistical power to detect differences and improve the generalizability of results [ 14 , 15 ];…”
Section: Why Is Sensor Data Integration Important?mentioning
confidence: 99%
“…Although data from all world countries are a distant goal, equivalent representation of all continents, ethnicities and the maximum number of countries is desired to reduce ML bias. Demographic information and other social determinants of health are typically not contained in these datasets, making it challenging to interrogate algorithms for bias 7 8. High-quality data are also fundamental for environmental-specific algorithm validation, which is essential before AI implementation.…”
mentioning
confidence: 99%
“…Available automated DR algorithm performance varies considerably in performance in the real world due to limited training data, including heterogeneity in disease presentations and suboptimal image quality 9. In addition, diverse sociodemographic and ethnic representation are necessary if generalisability is a goal 8…”
mentioning
confidence: 99%
See 1 more Smart Citation