2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2016
DOI: 10.1109/cvpr.2016.11
|View full text |Cite
|
Sign up to set email alerts
|

Image Question Answering Using Convolutional Neural Network with Dynamic Parameter Prediction

Abstract: We tackle image question answering (ImageQA) problem by learning a convolutional neural network (CNN) with a dynamic parameter layer whose weights are determined adaptively based on questions. For the adaptive parameter prediction, we employ a separate parameter prediction network, which consists of gated recurrent unit (GRU) taking a question as its input and a fully-connected layer generating a set of candidate weights as its output. However, it is challenging to construct a parameter prediction network for … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
179
0
1

Year Published

2017
2017
2020
2020

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 281 publications
(180 citation statements)
references
References 17 publications
0
179
0
1
Order By: Relevance
“…The task of Visual question answering [7], [8], [9], [10], [11] is well studied in the vision and language community, but it has been relatively less explored for providing explanation [3] arXiv:2002.10309v1 [cs.CV] 23 Jan 2020 for answer prediction. Recently, lot of works that focus on explanation models, one of that is image captioning for basic explanation of an image [12], [13], [14], [15], [16], [17], [18], [19], [20], [21], [22].…”
Section: Related Workmentioning
confidence: 99%
“…The task of Visual question answering [7], [8], [9], [10], [11] is well studied in the vision and language community, but it has been relatively less explored for providing explanation [3] arXiv:2002.10309v1 [cs.CV] 23 Jan 2020 for answer prediction. Recently, lot of works that focus on explanation models, one of that is image captioning for basic explanation of an image [12], [13], [14], [15], [16], [17], [18], [19], [20], [21], [22].…”
Section: Related Workmentioning
confidence: 99%
“…There are other methods besides CNN to implement VQA task. Noh et al [42] used an independent parametric predictive network with a GRU with the question as input and a fully connected layer generating as output. By combining hashing techniques, they reduced the complexity of constructing a parameter prediction network with a large number of parameters.…”
Section: ) Visual Question Answeringmentioning
confidence: 99%
“…Another proposed solution we identified is a dynamic parameter neural network whose parameters are determined adaptively based on input questions [34]. In this way the system reasons differently for each question.…”
Section: Identifying Clues In Image And/or Question To Generate Answermentioning
confidence: 99%