2021
DOI: 10.48550/arxiv.2105.11087
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Recent Advances and Trends in Multimodal Deep Learning: A Review

Abstract: Deep Learning has implemented a wide range of applications and has become increasingly popular in recent years. The goal of multimodal deep learning is to create models that can process and link information using various modalities. Despite the extensive development made for unimodal learning, it still cannot cover all the aspects of human learning. Multimodal learning helps to understand and analyze better when various senses are engaged in the processing of information. This paper focuses on multiple types o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
18
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
4
1
1

Relationship

0
10

Authors

Journals

citations
Cited by 16 publications
(18 citation statements)
references
References 129 publications
0
18
0
Order By: Relevance
“…Van Klink et al (Van Klink et al, 2022) recently highlighted how deep learning for ecology has been well represented in four distinct modalities: computer vision, acoustics, radar, and molecular methods. Recent successes in deep learning research have shown training models that utilize a combination of these representations can improve performances over a single modality, especially for fine-grained classification tasks (Morgado et al, 2021;Stahlschmidt et al, 2022;Summaira et al, 2021). We believe there are vast numbers of research directions to explore considering multimodal ecological data.…”
Section: Multi-modality Learningmentioning
confidence: 99%
“…Van Klink et al (Van Klink et al, 2022) recently highlighted how deep learning for ecology has been well represented in four distinct modalities: computer vision, acoustics, radar, and molecular methods. Recent successes in deep learning research have shown training models that utilize a combination of these representations can improve performances over a single modality, especially for fine-grained classification tasks (Morgado et al, 2021;Stahlschmidt et al, 2022;Summaira et al, 2021). We believe there are vast numbers of research directions to explore considering multimodal ecological data.…”
Section: Multi-modality Learningmentioning
confidence: 99%
“…To solve this problem, Vasiljević et al proposed an unsupervised learning algorithm to achieve “image translation” between different staining methods, which has proven useful in overcoming the barriers of inter-staining fusion [ 121 ]. Multi-modal learning, which refers to the combination of different sources of information [ 122 ], including pathological results, images, clinical history, and biochemical test results, has gained early success. An automated workflow was recently developed for integrating multi-modal data of mouse models of pancreatic cancer, which can transfer annotations between histology data and MSI data [ 123 ].…”
Section: Outlook For the Futurementioning
confidence: 99%
“…van Klink et al [17] recently highlighted how deep learning for ecology has been well represented in four distinct modalities: computer vision, acoustics, radar, and molecular methods. Recent successes in deep learning research have shown training models that utilize a combination of these representations can improve performances over a single modality, especially for fine-grained classification tasks [85,86,87]. We believe there are vast numbers of research directions to explore considering multimodal ecological data.…”
Section: Focus On Ai and Ecology Moving Forwardmentioning
confidence: 99%