Proceedings of the 29th ACM International Conference on Multimedia 2021
DOI: 10.1145/3474085.3475418
|View full text |Cite
|
Sign up to set email alerts
|

Multi-Modal Multi-Instance Learning for Retinal Disease Recognition

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
7
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 28 publications
(7 citation statements)
references
References 22 publications
0
7
0
Order By: Relevance
“…However, this creates representational bottlenecks because the demonstration's overall dimensionality decreases to l22k$$ {\left(\frac{l}{2}\right)}^2k $$ minus communicative network. Instead of using this, another option is suggested that decreases the calculation rate even more by reducing the representative bottleneck 2 …”
Section: Proposed Methodologymentioning
confidence: 99%
See 1 more Smart Citation
“…However, this creates representational bottlenecks because the demonstration's overall dimensionality decreases to l22k$$ {\left(\frac{l}{2}\right)}^2k $$ minus communicative network. Instead of using this, another option is suggested that decreases the calculation rate even more by reducing the representative bottleneck 2 …”
Section: Proposed Methodologymentioning
confidence: 99%
“…Retinitis pigmentosa, cataract, ocular surface neoplasia, age‐related macular degeneration (AMD), glaucoma, diabetic retinopathy (DR), and pterygium are the most common retinal illnesses. Aging, smoking, diabetes, obesity, cardiovascular disease, genetics, and other factors all contribute to an increased risk of progression and development of these diseases 2 . The majority of ocular illnesses affect both eyes, with 80% of all causes of visual impairment curable if detected earlier.…”
Section: Introductionmentioning
confidence: 99%
“…The success of deep learning based visual recognition in various applications has stimulated interest in solving retinal disease classification tasks from multi-modal images [ 14 , 15 , 19 23 ]. Yoo et al [ 20 ] employed random forests and VGG networks to extract features from OCT and CFP images, and then experimented with feature concatenation to aid in the multi-modal image diagnosis of AMD.…”
Section: Related Workmentioning
confidence: 99%
“…Chen et al [ 22 ] designed a vertical plane feature fusion method for multi-modal fusion to predict AMD disease use infrared reflectance and OCT images. Li et al [ 15 ] proposed a multi-modal multi-instance Learning deep learning framework using CFP and OCT, selectively fusing CFP and OCT modalities feature. Song et al [ 23 ] developed a multi-modal information bottleneck network (MMIB-Net) leveraging information bottleneck theory for feature representation in multiple modalities.…”
Section: Related Workmentioning
confidence: 99%
“…Recently, deep learning-based methods, such as image classification, objection detection and semantic segmentation, have been applied successfully on OCT images to solve various tasks including retinal layer segmentation [1,5], lesion localization [3] and multimodal retinal disease recognition [8,17,18], to name just a few. As these methods follow a standard supervised learning paradigm, a considerable amount of labeled data is often required.…”
Section: Introductionmentioning
confidence: 99%