Addressing Artificial Intelligence Bias in Retinal Diagnostics

Burlina, Philippe; Joshi, Neil; Paul, William; Pacheco, Kátia D.; Bressler, Neil M.

doi:10.1167/tvst.10.2.13

Cited by 79 publications

(63 citation statements)

References 18 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We found 20 published implementations of GANs specific to ophthalmology. Of these, 11 manuscripts synthesized fundus images, 9,19,20,23,32,37,[40][41][42][43][44] 6 manuscripts synthesized OCT images, [27][28][29][45][46][47] 2 manuscripts synthesized fluorescein angiography images, 48,49 and 1 manuscript synthesized infrared images 21 (Table 2). The majority of GANs were proof-ofconcept studies demonstrating feasibility of generating realistic-appearing synthetic images, specific implementations of GANs were published in 9 for diagnosis of ophthalmic diseases, including diabetic retinopathy (DR), 9,20,32,40 glaucoma, 28,45 age-related macular degeneration (AMD), 19,46 and meibomian gland dysfunction.…”

Section: Gans In Ophthalmologymentioning

confidence: 99%

“…Moreover, it is essential to train on diverse datasets with heterogeneous features present in real-world populations to avoid biased performance in practice. 8,9 Development of these datasets typically requires sharing data across institutions, which can be limited by time, cost, legislation, 10 and privacy regulations. 11 Data-and model-sharing methods including federated 12,13 and distributed 14,15 learning have shown potential in facilitating DL algorithm training without inter-institutional data sharing.…”

Section: Introductionmentioning

confidence: 99%

“…16 Deepfakes have garnered notoriety in the media for their nefarious applications, 17,18 but recently have been explored in multiple medical domains. 9,[19][20][21][22][23][24][25][26] Since ophthalmology has been at the forefront of the DL revolution, there are numerous potential applications of synthetic images, starting with fundus 9,19,20 and optical coherence tomography (OCT). [27][28][29] Synthetic images can be modified to adjust image features such as pigmentation, 9 image quality, 30 and even disease severity.…”

Section: Introductionmentioning

confidence: 99%

“…9,[19][20][21][22][23][24][25][26] Since ophthalmology has been at the forefront of the DL revolution, there are numerous potential applications of synthetic images, starting with fundus 9,19,20 and optical coherence tomography (OCT). [27][28][29] Synthetic images can be modified to adjust image features such as pigmentation, 9 image quality, 30 and even disease severity. 31 One of many potential applications is as an alternative solution to increase the size and diversity of training datasets for DL algorithms.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Deepfakes in Ophthalmology

Chen

Coyner

Chan

et al. 2021

Ophthalmology Science

View full text Add to dashboard Cite

Section: Gans In Ophthalmologymentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Deepfakes in Ophthalmology

Chen

Coyner

Chan

et al. 2021

Ophthalmology Science

View full text Add to dashboard Cite

“…Compared to other approaches, GANs have generated the most interest (e.g., see surveys in Creswell et al, 2018;Wang, She, & Ward, 2020;Wiatrak, Albrecht, & Nystrom, 2020). Generative models are used in reinforcement learning, time series predictions, fairness and privacy in artificial intelligence (AI) (Burlina, Joshi, Paul, Pacheco, & Bressler, 2020), and disentanglement (Paul, Wang, Alajaji, & Burlina, 2021), and can also be trained in a semisupervised manner, where labels and training examples are missing. Furthermore, these models are designed to produce several different outputs that are equally acceptable (Goodfellow, 2016;Karras, Laine, & Aila, 2019).…”

Section: Introductionmentioning

confidence: 99%

Least kth-Order and Rényi Generative Adversarial Networks

et al. 2021

Self Cite

View full text Add to dashboard Cite

We investigate the use of parameterized families of information-theoretic measures to generalize the loss functions of generative adversarial networks (GANs) with the objective of improving performance. A new generator loss function, least kth-order GAN (LkGAN), is introduced, generalizing the least squares GANs (LSGANs) by using a kth-order absolute error distortion measure with k≥1 (which recovers the LSGAN loss function when k=2). It is shown that minimizing this generalized loss function under an (unconstrained) optimal discriminator is equivalent to minimizing the kth-order Pearson-Vajda divergence. Another novel GAN generator loss function is next proposed in terms of Rényi cross-entropy functionals with order α>0, α≠1. It is demonstrated that this Rényi-centric generalized loss function, which provably reduces to the original GAN loss function as α→1, preserves the equilibrium point satisfied by the original GAN based on the Jensen-Rényi divergence, a natural extension of the Jensen-Shannon divergence. Experimental results indicate that the proposed loss functions, applied to the MNIST and CelebA data sets, under both DCGAN and StyleGAN architectures, confer performance benefits by virtue of the extra degrees of freedom provided by the parameters k and α, respectively. More specifically, experiments show improvements with regard to the quality of the generated images as measured by the Fréchet inception distance score and training stability. While it was applied to GANs in this study, the proposed approach is generic and can be used in other applications of information theory to deep learning, for example, the issues of fairness or privacy in artificial intelligence.

show abstract

Association of Biomarker-Based Artificial Intelligence With Risk of Racial Bias in Retinal Images

et al. 2023

View full text Add to dashboard Cite

ImportanceAlthough race is a social construct, it is associated with variations in skin and retinal pigmentation. Image-based medical artificial intelligence (AI) algorithms that use images of these organs have the potential to learn features associated with self-reported race (SRR), which increases the risk of racially biased performance in diagnostic tasks; understanding whether this information can be removed, without affecting the performance of AI algorithms, is critical in reducing the risk of racial bias in medical AI.ObjectiveTo evaluate whether converting color fundus photographs to retinal vessel maps (RVMs) of infants screened for retinopathy of prematurity (ROP) removes the risk for racial bias.Design, Setting, and ParticipantsThe retinal fundus images (RFIs) of neonates with parent-reported Black or White race were collected for this study. A u-net, a convolutional neural network (CNN) that provides precise segmentation for biomedical images, was used to segment the major arteries and veins in RFIs into grayscale RVMs, which were subsequently thresholded, binarized, and/or skeletonized. CNNs were trained with patients’ SRR labels on color RFIs, raw RVMs, and thresholded, binarized, or skeletonized RVMs. Study data were analyzed from July 1 to September 28, 2021.Main Outcomes and MeasuresArea under the precision-recall curve (AUC-PR) and area under the receiver operating characteristic curve (AUROC) at both the image and eye level for classification of SRR.ResultsA total of 4095 RFIs were collected from 245 neonates with parent-reported Black (94 [38.4%]; mean [SD] age, 27.2 [2.3] weeks; 55 majority sex [58.5%]) or White (151 [61.6%]; mean [SD] age, 27.6 [2.3] weeks, 80 majority sex [53.0%]) race. CNNs inferred SRR from RFIs nearly perfectly (image-level AUC-PR, 0.999; 95% CI, 0.999-1.000; infant-level AUC-PR, 1.000; 95% CI, 0.999-1.000). Raw RVMs were nearly as informative as color RFIs (image-level AUC-PR, 0.938; 95% CI, 0.926-0.950; infant-level AUC-PR, 0.995; 95% CI, 0.992-0.998). Ultimately, CNNs were able to learn whether RFIs or RVMs were from Black or White infants regardless of whether images contained color, vessel segmentation brightness differences were nullified, or vessel segmentation widths were uniform.Conclusions and RelevanceResults of this diagnostic study suggest that it can be very challenging to remove information relevant to SRR from fundus photographs. As a result, AI algorithms trained on fundus photographs have the potential for biased performance in practice, even if based on biomarkers rather than raw images. Regardless of the methodology used for training AI, evaluating performance in relevant subpopulations is critical.

show abstract

Addressing Artificial Intelligence Bias in Retinal Diagnostics

Cited by 79 publications

References 18 publications

Deepfakes in Ophthalmology

Deepfakes in Ophthalmology

Least kth-Order and Rényi Generative Adversarial Networks

Association of Biomarker-Based Artificial Intelligence With Risk of Racial Bias in Retinal Images

Contact Info

Product

Resources

About