2021
DOI: 10.1109/access.2021.3070212
|View full text |Cite
|
Sign up to set email alerts
|

A Review on Explainability in Multimodal Deep Neural Nets

Abstract: Artificial Intelligence techniques powered by deep neural nets have achieved much success in several application domains, most significantly and notably in the Computer Vision applications and Natural Language Processing tasks. Surpassing human-level performance propelled the research in the applications where different modalities amongst language, vision, sensory, text play an important role in accurate predictions and identification. Several multimodal fusion methods employing deep learning models are propos… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
37
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
3
1

Relationship

1
9

Authors

Journals

citations
Cited by 122 publications
(50 citation statements)
references
References 177 publications
0
37
0
Order By: Relevance
“…The factor of stress affecting any person should not be ignored for too long as it causes health issues, both mental and physical. Furthermore, using the power of artificial intelligence in healthcare that goes beyond our supervision will go a long way [ 46 ].…”
Section: Discussionmentioning
confidence: 99%
“…The factor of stress affecting any person should not be ignored for too long as it causes health issues, both mental and physical. Furthermore, using the power of artificial intelligence in healthcare that goes beyond our supervision will go a long way [ 46 ].…”
Section: Discussionmentioning
confidence: 99%
“…Besides, this work lacked in applying some of the techniques such as progressive resizing ( Bhatt, Ganatra & Kotecha, 2021a ), which can be applied to CNNs to carry out imaging-based diagnostics. Furthermore, visual ablation studies ( Bhatt, Ganatra & Kotecha, 2021b ; Joshi, Walambe & Kotecha, 2021 ; Gite et al, 2021 ) can be performed along with deep learning, which will significantly improve the detection of COVID-19 manifestations in the CXR images. Since only a limited number of CXR images are available for COVID-19 infection, out-of-distribution issues may arise, so more data from related distributions is needed for further evaluation.…”
Section: Discussionmentioning
confidence: 99%
“…The advantage of multimodality is its ability to extract and combine critical and comprehensive information from a variety of sources, thereby allowing for a far richer representation of the problem at hand. Several applications of multimodal AI were proposed in the literature [215], [216], but despite their outstanding performance, there is a lack of social acceptance due to their black-box nature [217]. Recently, some research was conducted for explaining multimodal AI systems [178], [218]- [222].…”
Section: Multimodal Xaimentioning
confidence: 99%