2023
DOI: 10.1038/s41598-023-30853-z
|View full text |Cite
|
Sign up to set email alerts
|

An interpretable transformer network for the retinal disease classification using optical coherence tomography

Abstract: Retinal illnesses such as age-related macular degeneration and diabetic macular edema will lead to irreversible blindness. With optical coherence tomography (OCT), doctors are able to see cross-sections of the retinal layers and provide patients with a diagnosis. Manual reading of OCT images is time-consuming, labor-intensive and even error-prone. Computer-aided diagnosis algorithms improve efficiency by automatically analyzing and diagnosing retinal OCT images. However, the accuracy and interpretability of th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
13
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 31 publications
(13 citation statements)
references
References 40 publications
0
13
0
Order By: Relevance
“…This approach combines cross convolution with a transformer structure that includes boundary regression and feature polarization, complemented by a CNN and a modified lightweight transformer structure, enhancing the performance of retinal layer segmentation. Jingzhen He et al 28 proposed the SwinPoly Transformer network, which builds connections between adjacent windows in the previous layer, boosting information exchange and allowing segmentation with multi-scale features.…”
Section: Transformers For Retinal Layers Segmentationmentioning
confidence: 99%
“…This approach combines cross convolution with a transformer structure that includes boundary regression and feature polarization, complemented by a CNN and a modified lightweight transformer structure, enhancing the performance of retinal layer segmentation. Jingzhen He et al 28 proposed the SwinPoly Transformer network, which builds connections between adjacent windows in the previous layer, boosting information exchange and allowing segmentation with multi-scale features.…”
Section: Transformers For Retinal Layers Segmentationmentioning
confidence: 99%
“…On the APTOS2019 dataset, their suggested technique produced a classification accuracy of 87.43%. A Swin Transformer has been suggested by He et al 21 for the classification of retinal OCT images. The Swin transformer identifies the connection between adjacent, non‐overlapping image patches.…”
Section: Related Workmentioning
confidence: 99%
“…This shifted window method results in better efficiency by focusing self-attention on small, local groups of patches. He et al [ 10 ] proposed a network that combined Swin-Transformer and PolyLoss [ 24 ] to diagnose several retinal diseases in OCT images automatically. To visually understand the decision-making ability of the model, they create Class Activation Mappings (CAMs) using the score-CAM [ 25 ] method.…”
Section: Literature Reviewmentioning
confidence: 99%
“…However, these transformer models must be pre-trained on enormous amounts of data to achieve competitive results. Several recent works [ 9 , 10 ] explore the effectiveness of modern transformer architectures based on ViT in diagnosing retinal diseases.…”
Section: Introductionmentioning
confidence: 99%