Proceedings of the 28th ACM International Conference on Multimedia 2020
DOI: 10.1145/3394171.3413668
|View full text |Cite
|
Sign up to set email alerts
|

Answer-Driven Visual State Estimator for Goal-Oriented Visual Dialogue

Abstract: A goal-oriented visual dialogue involves multi-turn interactions between two agents, Questioner and Oracle. During which, the answer given by Oracle is of great significance, as it provides golden response to what Questioner concerns. Based on the answer, Questioner updates its belief on target visual content and further raises another question. Notably, different answers drive into different visual beliefs and future questions. However, existing methods always indiscriminately encode answers after much longer… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 7 publications
(1 citation statement)
references
References 20 publications
0
1
0
Order By: Relevance
“…Works [1,2,23,30,34] that studies using RL to improve goal-oriented VD in the Guesswhat?! task setting [9] are also related.…”
Section: Related Workmentioning
confidence: 99%
“…Works [1,2,23,30,34] that studies using RL to improve goal-oriented VD in the Guesswhat?! task setting [9] are also related.…”
Section: Related Workmentioning
confidence: 99%