Association of Biomarker-Based Artificial Intelligence With Risk of Racial Bias in Retinal Images

Coyner, Aaron S.; Singh, Praveer; Brown, James M.; Ostmo, Susan; Chan, Robison Vernon Paul; Chiang, Michael F.; Kalpathy–Cramer, Jayashree; Campbell, J. Peter; Young, Benjamin K.; Jin, Kim Sang; Sönmez, Kemal; Schelonka, Robert L.; Jonas, Karyn; Kolli, Bhavana; Horowitz, Jason; Coki, Osode; Eccles, Cheryl-Ann; Sarna, Leora; Orlin, Anton; Berrocal, Audina M.; Negron, Catherin I.; Denser, Kimberly; Cumming, Kristi; Osentoski, Tammy; Check, Tammy; Zajechowski, Mary; Lee, Thomas; Nagiel, Aaron; Kruger, Evan; McGovern, Kathryn; Contractor, Dilshad; Havunjian, Margaret; Simmons, Charles F.; Murthy, Raghu; Galvis, Sharon; Rotter, Jerome I.; PhD, Ida Chen,; Li, Xiaohui; Taylor, Kent D.; Roll, Kaye; Hartnett, M. Elizabeth; Owen, Leah A.; Lucci, Lúcia Míriam Dumont; Moshfeghi, Darius M.; Nunez, Mariana; Wennber-Smith, Zac; Erdoğmuş, Deni̇z; Ioannidis, Stratis; Martínez-Castellanos, María Ana; Salinas-Longoria, Samantha; Romero, Rafael; Arriola, Andrea; Olguin-Manríquez, Francisco; Meraz-Gutiérrez, Miroslava; Dulanto-Reinoso, Carlos M.; Montero-Mendoza, Cristina

doi:10.1001/jamaophthalmol.2023.1310

Cited by 10 publications

(4 citation statements)

References 41 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Using ML algorithms, comprehensive datasets of health parameters and such associated features as retinal images, electronic health records, and biological markers can be analyzed in an automated, accurate, and efficient manner. For example, AI analysis of retinal images can provide information on age, sex, and race [ 8 – 10 ]. Similarly, AI algorithms trained on electronic health records can detect patterns and predict 47 systemic biomarkers as outcome variables, including BMI, blood pressure, and HbA1c levels [ 10 ].…”

Section: Retina Photo-based Ai In Evaluating Demographic and Medical ...mentioning

confidence: 99%

Retina Fundus Photograph-Based Artificial Intelligence Algorithms in Medicine: A Systematic Review

Grzybowski,

Jin,

Zhou

et al. 2024

Ophthalmol Ther

View full text Add to dashboard Cite

We conducted a systematic review of research in artificial intelligence (AI) for retinal fundus photographic images. We highlighted the use of various AI algorithms, including deep learning (DL) models, for application in ophthalmic and non-ophthalmic (i.e., systemic) disorders. We found that the use of AI algorithms for the interpretation of retinal images, compared to clinical data and physician experts, represents an innovative solution with demonstrated superior accuracy in identifying many ophthalmic (e.g., diabetic retinopathy (DR), age-related macular degeneration (AMD), optic nerve disorders), and non-ophthalmic disorders (e.g., dementia, cardiovascular disease). There has been a significant amount of clinical and imaging data for this research, leading to the potential incorporation of AI and DL for automated analysis. AI has the potential to transform healthcare by improving accuracy, speed, and workflow, lowering cost, increasing access, reducing mistakes, and transforming healthcare worker education and training.

show abstract

Section: Retina Photo-based Ai In Evaluating Demographic and Medical ...mentioning

confidence: 99%

Retina Fundus Photograph-Based Artificial Intelligence Algorithms in Medicine: A Systematic Review

Grzybowski,

Jin,

Zhou

et al. 2024

Ophthalmol Ther

View full text Add to dashboard Cite

show abstract

Section: Introductionmentioning

confidence: 99%

“…For example, there are fewer Black and Asian DR patients present in ophthalmic care, which is data inequality. 5 In addition, prior studies have shown that retinal anatomy is related to sex and racial information, 18,19 which is an example of data characteristic variability. Mitigating data inequality and addressing data characteristic variability is imperative to reduce performance disparities and achieve more equitable outcomes in deep learning.…”

Section: Introductionmentioning

confidence: 99%

Equitable deep learning for diabetic retinopathy detection using multi-dimensional retinal imaging with fair adaptive scaling: a retrospective study

Shi,

Afzal,

Huang

et al. 2024

Preprint

View full text Add to dashboard Cite

SummaryBackgroundAs deep learning becomes increasingly accessible for automated detection of diabetic retinopathy (DR), questions persist regarding its performance equity among diverse identity groups. We aimed to explore the fairness of current deep learning models and further create a more equitable model designed to minimize disparities in performance across groups.MethodsThis study used one proprietary and two publicly available datasets, containing two-dimensional (2D) wide-angle color fundus, scanning laser ophthalmoscopy (SLO) fundus, and three-dimensional (3D) Optical Coherence Tomography (OCT) B-Scans, to assess deep learning models for DR detection. We developed a fair adaptive scaling (FAS) module that dynamically adjusts the significance of samples during model training for DR detection, aiming to lessen performance disparities across varied identity groups. FAS was incorporated into both 2D and 3D deep learning models to facilitate the binary classification of DR and non-DR cases. The area under the receiver operating characteristic curve (AUC) was adopted to measure the model performance. Additionally, we devised an equity-scaled AUC metric that evaluates model fairness by balancing overall AUC against disparities among groups.FindingsUsing in-house color fundus on the racial attribute, the overall AUC and ES-AUC of EfficientNet after integrating with FAS improved from 0.88 and 0.83 to 0.90 and 0.84 (p < 0.05), where the AUCs for Asians and Whites improved by 0.04 and 0.03, respectively (p < 0.01). On gender, the overall AUC and ES-AUC of EfficientNet after integrating with FAS both improved by 0.01 (p < 0.05). While using in-house SLO fundus on race, the overall AUC and ES-AUC of EfficientNet after integrating FAS improved from 0.80 to 0.83 (p < 0.01), where the AUCs for Asians, Blacks, and Whites improved by 0.02, 0.01 and 0.04, respectively (p < 0.05). On gender, FAS improved EfficientNet’s overall AUC and ES-AUC both by 0.02, where the same improvement of 0.02 (p < 0.01) was gained for Females and Males. Using 3D deep learning model DenseNet121 on in-house OCT-B-Scans on race, FAS improved the overall AUC and ES-AUC from 0.875 and 0.81 to 0.884 and 0.82 respectively, where the AUCs for Asians and Blacks improved by 0.03 and 0.02 (p < 0.01). On gender, FAS improved the overall AUC and ES-AUC of DenseNet121 by 0.04 and 0.03, whereas the AUCs for Females and Males improved by 0.05 and 0.04 (p < 0.01), respectively.InterpretationExisting deep learning models indeed exhibit variable performance across diverse identity groups in DR detection. The FAS proves beneficial in enhancing model equity and boosting DR detection accuracy, particularly for underrepresented groups.

show abstract

“…It has been applied in ophthalmology for identifying vision-related diseases [ 2 , 3 ]. However, despite the known racial and ethnic differences in ocular measurements and pathological conditions [ 4 , 5 ], few studies have examined the application of deep learning in different races. Especially the mismatch between training and testing data distribution causes significant degradation in the model performance in multi-ethnic scenarios.…”

Section: Introductionmentioning

confidence: 99%

Deep Transfer Learning for Ethnically Distinct Populations: Prediction of Refractive Error Using Optical Coherence Tomography

Jain,

Yoo,

Ryu

et al. 2023

Ophthalmol Ther

View full text Add to dashboard Cite

Introduction The mismatch between training and testing data distribution causes significant degradation in the deep learning model performance in multi-ethnic scenarios. To reduce the performance differences between ethnic groups and image domains, we built a deep transfer learning model with adaptation training to predict uncorrected refractive errors using posterior segment optical coherence tomography (OCT) images of the macula and optic nerve. Methods Observational, cross-sectional, multicenter study design. We pre-trained a deep learning model on OCT images from the B&VIIT Eye Center (Seoul, South Korea) ( N = 2602 eyes of 1301 patients). OCT images from Poona Eye Care (Pune, India) were chronologically sorted into adaptation training data ( N = 60 eyes of 30 patients) for transfer learning and test data ( N = 142 eyes of 71 patients) for validation. Deep learning models were trained to predict spherical equivalent (SE) and mean keratometry ( K ) values via transfer learning for domain adaptation. Results Both adaptation models for SE and K were significantly better than those without adaptation ( P < 0.001). In myopia/hyperopia classification, the model trained on circular optic disc OCT images yielded the best performance (accuracy = 74.7%). It also performed best in estimating SE with the lowest mean absolute error (MAE) of 1.58 D. For classifying the degree of corneal curvature, the optic nerve vertical algorithm performed best (accuracy = 65.7%). The optic nerve horizontal model achieved the lowest MAE (1.85 D) when predicting the K value. Saliency maps frequently highlighted the retinal nerve fiber layers. Conclusions Adaptation training via transfer learning is an effective technique for estimating refractive errors and K values using macular and optic nerve OCT images from ethnically heterogeneous populations. Further studies with larger sample sizes and various data sources are needed to confirm the feasibility of the proposed algorithm. Supplementary Information The online version contains supplementary material available at 10.1007/s40123-023-00842-6.

show abstract

Association of Biomarker-Based Artificial Intelligence With Risk of Racial Bias in Retinal Images

Cited by 10 publications

References 41 publications

Retina Fundus Photograph-Based Artificial Intelligence Algorithms in Medicine: A Systematic Review

Retina Fundus Photograph-Based Artificial Intelligence Algorithms in Medicine: A Systematic Review

Equitable deep learning for diabetic retinopathy detection using multi-dimensional retinal imaging with fair adaptive scaling: a retrospective study

Deep Transfer Learning for Ethnically Distinct Populations: Prediction of Refractive Error Using Optical Coherence Tomography

Contact Info

Product

Resources

About