2022
DOI: 10.1007/s11063-021-10689-2
|View full text |Cite
|
Sign up to set email alerts
|

Research on Visual Question Answering Based on GAT Relational Reasoning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 11 publications
(3 citation statements)
references
References 27 publications
0
3
0
Order By: Relevance
“…At present, AI is developing rapidly. As one of the applications of intelligent systems, VQA has attracted more Chinese researchers and scientists, focusing on this frontier field and promoting research progress in related fields ( Guo & Han, 2022 ; Guo & Han, 2023 ; Miao et al, 2022b ; Miao et al, 2022a ; Peng et al, 2022a ; Shen et al, 2022 ; Liu et al, 2022a ).…”
Section: Survey Methodologymentioning
confidence: 99%
“…At present, AI is developing rapidly. As one of the applications of intelligent systems, VQA has attracted more Chinese researchers and scientists, focusing on this frontier field and promoting research progress in related fields ( Guo & Han, 2022 ; Guo & Han, 2023 ; Miao et al, 2022b ; Miao et al, 2022a ; Peng et al, 2022a ; Shen et al, 2022 ; Liu et al, 2022a ).…”
Section: Survey Methodologymentioning
confidence: 99%
“…Cao et al [45] proposed a Graph Matching Attention network that constructs a graph from the input image and question and infers the relationships between different modalities using cross-modality graph matching attention. Miao et al [46] organized an image as a scene graph and used question-guided node attention to construct a dynamic scene graph. Guo et al [47] proposed SCAVQAN, which sets thresholds for attention scores to filter out features that are not helpful for predicting the answer.…”
Section: Multimodal Interactionmentioning
confidence: 99%
“…To address the question that visual question and answer models neglect the modeling of object relationships in image, Cheng [42] proposed the relational reasoning mode Graph Attention Network Relation Reasoning(GAT2R) by introducing a graph attention mechanism. GAT2R model includes question feature extraction, scene graph generation, scene graph update part, multimodal fusion and answer prediction.…”
Section: B Attentional Mechanismsmentioning
confidence: 99%