2020
DOI: 10.48550/arxiv.2005.00796
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

A Simple Language Model for Task-Oriented Dialogue

Abstract: Task-oriented dialogue is often decomposed into three tasks: understanding user input, deciding actions, and generating a response. While such decomposition might suggest a dedicated model for each sub-task, we find a simple, unified approach leads to state-of-the-art performance on the MultiWOZ dataset. SimpleTOD is a simple approach to task-oriented dialogue that uses a single causal language model trained on all sub-tasks recast as a single sequence prediction problem. This allows SimpleTOD to fully leverag… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
56
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 25 publications
(56 citation statements)
references
References 34 publications
0
56
0
Order By: Relevance
“…Reinforcement learning [18,58,80] and powerful transformers such as GPT-2 [29,49] have been used to optimize the dialog system. These methods typically require a large task-specific corpus to train policy over conversational actions, while our method only needs few interactions with humans for a new task, and our policies can be extended to actions beyond conversational ones.…”
Section: Natural Language Processing: Dialog Systemsmentioning
confidence: 99%
“…Reinforcement learning [18,58,80] and powerful transformers such as GPT-2 [29,49] have been used to optimize the dialog system. These methods typically require a large task-specific corpus to train policy over conversational actions, while our method only needs few interactions with humans for a new task, and our policies can be extended to actions beyond conversational ones.…”
Section: Natural Language Processing: Dialog Systemsmentioning
confidence: 99%
“…Annotation error Even the recent versions of MultiWOZ still have incorrect labels and inconsistent annotations [3,21,4,20,5]. These noises are the primary reason why it is challenging to accurately evaluate the model performance.…”
Section: Data Limitationmentioning
confidence: 99%
“…These systems are the core modules of virtual assistants (e.g., Apple Siri and Amazon Alexa), and they provide natural language interfaces for online services [1]. Recently, there has been growing interest in developing deep learning-based end-to-end ToD systems [2,3,4,5,6,7,8,9,10,11,12,13,14,15,16] because they can handle complex dialogue patterns with minimal hand-crafted rules. To advance the existing state-of-the-art, large-scale datasets [17,1,16] have been proposed for training and evaluating such data-driven systems.…”
Section: Introductionmentioning
confidence: 99%