2022
DOI: 10.3390/mi13060947
|View full text |Cite
|
Sign up to set email alerts
|

Multi-Label Fundus Image Classification Using Attention Mechanisms and Feature Fusion

Abstract: Fundus diseases can cause irreversible vision loss in both eyes if not diagnosed and treated immediately. Due to the complexity of fundus diseases, the probability of fundus images containing two or more diseases is extremely high, while existing deep learning-based fundus image classification algorithms have low diagnostic accuracy in multi-labeled fundus images. In this paper, a multi-label classification of fundus disease with binocular fundus images is presented, using a neural network algorithm model base… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 18 publications
(3 citation statements)
references
References 29 publications
0
3
0
Order By: Relevance
“…In extreme cases, if the identity mapping is optimal, the network can use an easier way to construct the identity mapping, pushing the residuals F(x) = H(x) − x to zero, which is easier than fitting the identity mapping with multiple non-linear layers. Therefore, the gradient flows directly through these connections, reducing the disappearance or explosion of the gradient, and the training of the deep learning network becomes easier [ 36 , 37 , 38 ].…”
Section: Construction Of the Residual Network Modelmentioning
confidence: 99%
“…In extreme cases, if the identity mapping is optimal, the network can use an easier way to construct the identity mapping, pushing the residuals F(x) = H(x) − x to zero, which is easier than fitting the identity mapping with multiple non-linear layers. Therefore, the gradient flows directly through these connections, reducing the disappearance or explosion of the gradient, and the training of the deep learning network becomes easier [ 36 , 37 , 38 ].…”
Section: Construction Of the Residual Network Modelmentioning
confidence: 99%
“…Although the existing methods have achieved good results in extracting fundus lesion features [ 27 ], the data volume still affects the classification performance of the network, and the classification effect of the network cannot be visually analyzed. Different from the above methods, this paper proposes a data enhancement method guided by Grad-CAM visual attention based on the integrated neural network, which amplifies the fundus image dataset in a targeted manner, helps the model learn rich subtle features, and improves recognition accuracy.…”
Section: Related Workmentioning
confidence: 99%
“…We demonstrated in this paper that the right choice of filter significantly impacts the accuracy of the training model. Table 4 compares the accuracy obtained from our method with the results obtained in [17], [28], [29]. These papers investigate the multiclassification problem using the same ODIR dataset we chose for our experiments.…”
Section: Recall Tp Tp Fnmentioning
confidence: 99%