Taskmaster-1: Toward a Realistic and Diverse Dialog Dataset

Byrne, Bill; Krishnamoorthi, K. S.; Sankar, Chinnadhurai; Neelakantan, Arvind; Goodrich, Ben; Duckworth, Daniel; Yavuz, Semih; Dubey, Amit; Kim, Kyu-Young; Cedilnik, Andy

doi:10.18653/v1/d19-1459

Cited by 135 publications

(156 citation statements)

References 22 publications

Supporting

Mentioning

154

Contrasting

Order By: Relevance

“…Similar in size and content to MultiWOZ is Taskmaster-1 task-based dialogue dataset (Byrne et al 2019). It includes around 13K dialogues in six domains: ordering pizza, setting auto repair appointments, arranging taxi services, ordering movie tickets, ordering coffee drinks and making restaurant reservations.…”

Section: Datasets For Task-oriented Dialogue Systemsmentioning

confidence: 99%

Survey on evaluation methods for dialogue systems

et al. 2020

View full text Add to dashboard Cite

In this paper, we survey the methods and concepts developed for the evaluation of dialogue systems. Evaluation, in and of itself, is a crucial part during the development process. Often, dialogue systems are evaluated by means of human evaluations and questionnaires. However, this tends to be very cost-and time-intensive. Thus, much work has been put into finding methods which allow a reduction in involvement of human labour. In this survey, we present the main concepts and methods. For this, we differentiate between the various classes of dialogue systems (task-oriented, conversational, and question-answering dialogue systems). We cover each class by introducing the main technologies developed for the dialogue systems and then present the evaluation methods regarding that class.

show abstract

Section: Datasets For Task-oriented Dialogue Systemsmentioning

confidence: 99%

Survey on evaluation methods for dialogue systems

et al. 2020

View full text Add to dashboard Cite

show abstract

“…Some datasets contain singledomain conversations [48,51,67,68]. With the increasing demands to handle various tasks in real-world applications, some large-scale multi-domain corpora [69][70][71] have been collected recently. These datasets have higher language variation and task complexity.…”

Section: Corporamentioning

confidence: 99%

Recent advances and challenges in task-oriented dialog systems

Zhang

Takanobu

Huang

et al. 2020

Sci. China Technol. Sci.

129

View full text Add to dashboard Cite

Due to the significance and value in human-computer interaction and natural language processing, task-oriented dialog systems are attracting more and more attention in both academic and industrial communities. In this paper, we survey recent advances and challenges in task-oriented dialog systems. We also discuss three critical topics for task-oriented dialog systems: (1) improving data efficiency to facilitate dialog modeling in low-resource settings, (2) modeling multi-turn dynamics for dialog policy learning to achieve better task-completion performance, and (3) integrating domain ontology knowledge into the dialog model. Besides, we review the recent progresses in dialog evaluation and some widely-used corpora. We believe that this survey, though incomplete, can shed a light on future research in task-oriented dialog systems. task-oriented dialog systems, natural language understanding, dialog policy, dialog state tracking, natural language generation

show abstract

“…To foster research on dialog policy learning for virtual digital assistants, several task-oriented dialog corpora have been introduced in recent years, such as SimDial (Zhao and Eskenazi, 2018), Multi-WoZ (Budzianowski et al, 2018), Taskmaster (Byrne et al, 2019), and Schema Guided Dialog (Rastogi et al, 2019), to name a few. Deep learning approaches, including mixture models (Pei et al, 2019) hierarchical encoder/decoder Chen et al, 2019), reinforcement learning (Zhao et al, 2019), and pre-trained language models (Wu et al, 2019;Peng et al, 2020;Hosseini-Asl et al, 2020), have significantly advanced dialog policy research in the past few years , setting new state-of-the-art performance limits.…”

Section: Introductionmentioning

confidence: 99%

Resource Constrained Dialog Policy Learning Via Differentiable Inductive Logic Programming

Zhou

Beirami

Crook

et al. 2020

Proceedings of the 28th International Conference on Computational Linguistics

View full text Add to dashboard Cite

Motivated by the needs of resource constrained dialog policy learning, we introduce dialog policy via differentiable inductive logic (DILOG). We explore the tasks of one-shot learning and zero-shot domain transfer with DILOG on SimDial and MultiWoZ. Using a single representative dialog from the restaurant domain, we train DILOG on the SimDial dataset and obtain 99+% in-domain test accuracy. We also show that the trained DILOG zero-shot transfers to all other domains with 99+% accuracy, proving the suitability of DILOG to slot-filling dialogs. We further extend our study to the MultiWoZ dataset achieving 90+% inform and success metrics. We also observe that these metrics are not capturing some of the shortcomings of DILOG in terms of false positives, prompting us to measure an auxiliary Action F1 score. We show that DILOG is 100x more data efficient than state-of-the-art neural approaches on MultiWoZ while achieving similar performance metrics. We conclude with a discussion on the strengths and weaknesses of DILOG.

show abstract

Taskmaster-1: Toward a Realistic and Diverse Dialog Dataset

Cited by 135 publications

References 22 publications

Survey on evaluation methods for dialogue systems

Survey on evaluation methods for dialogue systems

Recent advances and challenges in task-oriented dialog systems

Resource Constrained Dialog Policy Learning Via Differentiable Inductive Logic Programming

Contact Info

Product

Resources

About