Findings of the Association for Computational Linguistics: EMNLP 2020 2020
DOI: 10.18653/v1/2020.findings-emnlp.210
|View full text |Cite
|
Sign up to set email alerts
|

MultiDM-GCN: Aspect-guided Response Generation in Multi-domain Multi-modal Dialogue System using Graph Convolutional Network

Abstract: In the recent past, dialogue systems have gained immense popularity and have become ubiquitous. During conversations, humans not only rely on languages but seek contextual information through visual contents as well. In every task-oriented dialogue system, the user is guided by the different aspects of a product or service that regulates the conversation towards selecting the product or service. In this work, we present a multi-modal conversational framework for a task-oriented dialogue setup that generates th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 14 publications
(5 citation statements)
references
References 38 publications
0
5
0
Order By: Relevance
“…Other businesses and tasks built on the above interactions include automated identification of interactions and communications and response generation for autonomous, open-domain, context-aware, multi-turn, multi-level, multilingual and knowledge-based dialogue, interactions and communications. Examples are [4,17,36,38]:…”
Section: The Smart Fintech Ecosystemmentioning
confidence: 99%
“…Other businesses and tasks built on the above interactions include automated identification of interactions and communications and response generation for autonomous, open-domain, context-aware, multi-turn, multi-level, multilingual and knowledge-based dialogue, interactions and communications. Examples are [4,17,36,38]:…”
Section: The Smart Fintech Ecosystemmentioning
confidence: 99%
“…Currently, M. Firdaus et al [26] and W. Liu et al [27] think that dialogue reading comprehension is mainly used in dialogue systems. W. Wei et al [28] think dialogue reading comprehension is also used for intelligent human-computer interaction systems.…”
Section: B Transformers For Learning Dialoguementioning
confidence: 99%
“…However, these works focus on the QA style visual dialog, rather than the conversation style with which we are more concerned. Another strand of works address multi-turn dialog generation grounded with vision [2,11,18,47,63]. [56] studied the task of image-grounded conversations where utterances are generated about a shared image.…”
Section: Jointly Modeling Visual and Textual Informationmentioning
confidence: 99%
“…Different from previous datasets that focus on limited-domain multi-modal conversations such as E-commerce [18,63] or communications grounded in a single image [68], the recently released large open-domain multi-modal dataset OpenViDial [54] contains millions of dialog turns, with each dialog turn associated with its specific visual context (image), in which the dialog utterance takes place, rather than a single image for the whole episode. This better mimics multi-modal dialog generation in the real world.…”
Section: The Openvidial Dataset and Task Statementmentioning
confidence: 99%