2019
DOI: 10.1109/tip.2019.2902106
|View full text |Cite
|
Sign up to set email alerts
|

Multi-Turn Video Question Answering via Hierarchical Attention Context Reinforced Networks

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
4

Relationship

0
10

Authors

Journals

citations
Cited by 31 publications
(6 citation statements)
references
References 34 publications
0
6
0
Order By: Relevance
“…This improves customer service operations and improves the user experience while freeing up human agents to handle more complicated requests (Gao et al, 2019). while also providing sellers with new opportunities to showcase their products to a broader audience (Zhao et al, 2017).…”
Section: Review On the Ai-enhanced Online Marketplacesmentioning
confidence: 99%
“…This improves customer service operations and improves the user experience while freeing up human agents to handle more complicated requests (Gao et al, 2019). while also providing sellers with new opportunities to showcase their products to a broader audience (Zhao et al, 2017).…”
Section: Review On the Ai-enhanced Online Marketplacesmentioning
confidence: 99%
“…Therefore, Yang et al [65] develop a novel set generation model based on RL, which not only captures the correlations between items, but also reduces the dependence on the item order. Zhao et al [75,76] propose to utilize reinforcement learning for video question answering.…”
Section: Drugsmentioning
confidence: 99%
“…Jang et al [38], [39] released a large-scale VideoQA dataset named TGIF-QA and proposed a dual-LSTM based method with both spatial and temporal attention. Later on, some hierarchical attention and co-attention based methods [16], [40], [41], [42], [43] are proposed to learn appearance-motion and question-related multi-modal interactions. Le et al [17] proposed hierarchical conditional relation network (HCRN) to construct sophisticated structures for representation and reasoning over videos.…”
Section: Visual Question Answeringmentioning
confidence: 99%