Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conferen 2019
DOI: 10.18653/v1/d19-1459
|View full text |Cite
|
Sign up to set email alerts
|

Taskmaster-1: Toward a Realistic and Diverse Dialog Dataset

Abstract: A significant barrier to progress in data-driven approaches to building dialog systems is the lack of high quality, goal-oriented conversational data. To help satisfy this elementary requirement, we introduce the initial release of the Taskmaster-1 dataset which includes 13,215 task-based dialogs comprising six domains. Two procedures were used to create this collection, each with unique advantages. The first involves a two-person, spoken "Wizard of Oz" (WOz) approach in which trained agents and crowdsourced w… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

2
154
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
4
2

Relationship

1
9

Authors

Journals

citations
Cited by 135 publications
(156 citation statements)
references
References 22 publications
2
154
0
Order By: Relevance
“…Similar in size and content to MultiWOZ is Taskmaster-1 task-based dialogue dataset (Byrne et al 2019). It includes around 13K dialogues in six domains: ordering pizza, setting auto repair appointments, arranging taxi services, ordering movie tickets, ordering coffee drinks and making restaurant reservations.…”
Section: Datasets For Task-oriented Dialogue Systemsmentioning
confidence: 99%
“…Similar in size and content to MultiWOZ is Taskmaster-1 task-based dialogue dataset (Byrne et al 2019). It includes around 13K dialogues in six domains: ordering pizza, setting auto repair appointments, arranging taxi services, ordering movie tickets, ordering coffee drinks and making restaurant reservations.…”
Section: Datasets For Task-oriented Dialogue Systemsmentioning
confidence: 99%
“…Some datasets contain singledomain conversations [48,51,67,68]. With the increasing demands to handle various tasks in real-world applications, some large-scale multi-domain corpora [69][70][71] have been collected recently. These datasets have higher language variation and task complexity.…”
Section: Corporamentioning
confidence: 99%
“…To foster research on dialog policy learning for virtual digital assistants, several task-oriented dialog corpora have been introduced in recent years, such as SimDial (Zhao and Eskenazi, 2018), Multi-WoZ (Budzianowski et al, 2018), Taskmaster (Byrne et al, 2019), and Schema Guided Dialog (Rastogi et al, 2019), to name a few. Deep learning approaches, including mixture models (Pei et al, 2019) hierarchical encoder/decoder Chen et al, 2019), reinforcement learning (Zhao et al, 2019), and pre-trained language models (Wu et al, 2019;Peng et al, 2020;Hosseini-Asl et al, 2020), have significantly advanced dialog policy research in the past few years , setting new state-of-the-art performance limits.…”
Section: Introductionmentioning
confidence: 99%