2023
DOI: 10.1109/access.2023.3335216
|View full text |Cite
|
Sign up to set email alerts
|

A Critical Analysis of Benchmarks, Techniques, and Models in Medical Visual Question Answering

Suheer Al-Hadhrami,
Mohamed El Bachir Menai,
Saad Al-Ahmadi
et al.

Abstract: This paper comprehensively reviews medical VQA models, structures, and datasets, focusing on combining vision and language. Over 75 models and their statistical and SWOT (Strengths, Weaknesses, Opportunities, Threats) analyses were compared and analyzed. The study highlights whether the researchers in the general field influence those in the medical field. According to an analysis of text encoding techniques, LSTM is the approach that is utilized the most (42%), followed by non-text methods (14%) and BiLSTM (1… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
references
References 182 publications
(280 reference statements)
0
0
0
Order By: Relevance