2024
DOI: 10.1101/2024.02.12.24302676
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Deep Learning for Multi-Label Disease Classification of Retinal Images: Insights from Brazilian Data for AI Development in Lower-Middle Income Countries

Dewi S.W. Gould,
Jenny Yang,
David A. Clifton

Abstract: Retinal fundus imaging is a powerful tool for disease screening and diagnosis in opthalmology. With the advent of machine learning and artificial intelligence, in particular modern computer vision classification algorithms, there is broad scope for technology to improve accuracy, increase accessibility and reduce cost in these processes. In this paper we present the first deep learning model trained on the first Brazilian multi-label opthalmological datatset. We train a multi-label classifier using over 16,000… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 23 publications
0
3
0
Order By: Relevance
“…The employment of advanced deep learning techniques, such as ConvNext V2, further underscores the dataset’s utility [ 35 ] and has yielded notable advancements in the field of retina fundus photo analysis. The results showcase the superiority of the architecture over the previously established benchmarks, primarily those set by ResNet 50 in the classification tasks of binary diabetic retinopathy and three-class diabetic retinopathy states [ 39 , 41 ].…”
Section: Discussionmentioning
confidence: 75%
See 1 more Smart Citation
“…The employment of advanced deep learning techniques, such as ConvNext V2, further underscores the dataset’s utility [ 35 ] and has yielded notable advancements in the field of retina fundus photo analysis. The results showcase the superiority of the architecture over the previously established benchmarks, primarily those set by ResNet 50 in the classification tasks of binary diabetic retinopathy and three-class diabetic retinopathy states [ 39 , 41 ].…”
Section: Discussionmentioning
confidence: 75%
“…We opted for ConvNext V2, diverging from the conventional use of ResNet-50 used in prior research on the BRSET dataset [ 39 41 ]. The reason for using ConvNext V2 instead of ResNet-50, is that ConvNext V2 is an evolution of the ResNet-50 architecture, and introduces several enhancements over its predecessors, such as layer normalization, expanded kernel sizes, or regularization techniques [ 35 , 42 ].…”
Section: Methodsmentioning
confidence: 99%
“…It leverages self-supervised pre-training on large-scale datasets, a multi-crop strategy, attention mechanisms, and scalability to learn rich and generalizable visual features. These architectures offer advantages over their predecessors, such as ResNets and the original ViT, which have been widely used in prior retinal imaging research [32][33][34] .…”
Section: Network Frameworkmentioning
confidence: 99%