Proceedings of the 26th ACM International Conference on Multimedia 2018
DOI: 10.1145/3240508.3240563
|View full text |Cite
|
Sign up to set email alerts
|

Explore Multi-Step Reasoning in Video Question Answering

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
36
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 49 publications
(36 citation statements)
references
References 25 publications
0
36
0
Order By: Relevance
“…Earlier efforts (Andreas et al, 2016; designed questions that are expressed as elementary operation programs. More related to our work, Song et al (2018);Yi* et al (2020) extended the prior work to the video domain with questions focusing on the temporal variance of video frames.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Earlier efforts (Andreas et al, 2016; designed questions that are expressed as elementary operation programs. More related to our work, Song et al (2018);Yi* et al (2020) extended the prior work to the video domain with questions focusing on the temporal variance of video frames.…”
Section: Related Workmentioning
confidence: 99%
“…In (Antol et al, 2015), Visual7W (Zhu et al, 2016 TGIF-QA (Jang et al, 2017), TV-QA (Lei et al, 2018) IQA (Gordon et al, 2018), EQA (Wijmans et al, 2019) Image/video grounded dialogues, navigation dialogues VisDial ), GuessWhat (De Vries et al, 2017 AVSD , CVDN (Thomason et al, 2019) Synthetic image/video QA SHAPE (Andreas et al, 2016), CLEVR SVQA (Song et al, 2018), CLEVRER (Yi* et al, 2020) Synthetic dialogues bAbI (Bordes et al, 2017) MNIST Dialog (Seo et al, 2017), CLEVR-Dialog (Kottur et al, 2019)…”
Section: Dvd Question Types Sub-types and Examplesmentioning
confidence: 99%
“…We evaluate our proposed architecture on the three recent benchmarks, namely, MSVD-QA, MSRVTT-QA [30], [31] and SVQA [26].…”
Section: Experiments a Datasetsmentioning
confidence: 99%
“…We train our models with Adam optimizer with an initial learning rate of 10 −4 and a batch size of 64. To be compatible with related works [20], [26], [19], we use accuracy as evaluation metric for all tasks.…”
Section: Modelmentioning
confidence: 99%
See 1 more Smart Citation