International Journal of Intelligent Systems and Applications in Engineering 2022
DOI: 10.17762/ijisae.v10i3s.2427
|View full text |Cite
|
Sign up to set email alerts
|

BERT based Hierarchical Alternating Co-Attention Visual Question Answering using Bottom-Up Features

Abstract: Answering a question from a given visual image is a very well-known vision language task where the machine is given a pair of an image and a related question and the task is to generate the natural language answer. Humans can easily relate image content with a given question and reason about how to generate an answer. But automation of this task is challenging as it involves many computer vision and NLP tasks. Most of the literature focus on a novel attention mechanism for joining image and question features i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 22 publications
0
2
0
Order By: Relevance
“…Most of the recent work uses transformers for question featurization. [25,26] utilized transformer BERT for language feature extraction. In [87] authors concatenated the output of four consecutive BERT layers in order to generate hierarchical features from the question.…”
Section: Methods Papermentioning
confidence: 99%
See 1 more Smart Citation
“…Most of the recent work uses transformers for question featurization. [25,26] utilized transformer BERT for language feature extraction. In [87] authors concatenated the output of four consecutive BERT layers in order to generate hierarchical features from the question.…”
Section: Methods Papermentioning
confidence: 99%
“…VilBERT [13],LXMERT [14], UNITER [15], Oscar [16], Coarse to fine reasoning [23], MPC [25], Hie-Alternation coattention [26], Rich Image region VQA [27], KRISP [50] Source: Own elaboration.…”
Section: Grumentioning
confidence: 99%