Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Confer 2021
DOI: 10.18653/v1/2021.acl-long.55
|View full text |Cite
|
Sign up to set email alerts
|

TicketTalk: Toward human-level performance with end-to-end, transaction-based dialog systems

Abstract: We present a data-driven, end-to-end approach to transaction-based dialog systems that performs at near-human levels in terms of verbal response quality and factual grounding accuracy. We show that two essential components of the system produce these results: a sufficiently large and diverse, in-domain labeled dataset, and a neural network-based, pretrained model that generates both verbal responses and API call predictions. In terms of data, we introduce TicketTalk, a movie ticketing dialog dataset with 23,78… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 15 publications
(11 citation statements)
references
References 23 publications
0
8
0
Order By: Relevance
“…The assistant's turn is marked with an "[ASSISTANT]" tag, and the user's turn is marked with a "[USER]" tag. TicketTalk (Byrne et al, 2021) contains conversations on booking movie tickets between an assistant and a user. The assistant's and user's turns are similarly marked as in TM-2.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…The assistant's turn is marked with an "[ASSISTANT]" tag, and the user's turn is marked with a "[USER]" tag. TicketTalk (Byrne et al, 2021) contains conversations on booking movie tickets between an assistant and a user. The assistant's and user's turns are similarly marked as in TM-2.…”
Section: Methodsmentioning
confidence: 99%
“…TicketTalk (Byrne et al, 2021) which contains conversations on booking movie tickets between an assistant and a user. The assistant's and user's turned are similarly marked as in TM-2.…”
Section: F Perplexity After Fine-tuningmentioning
confidence: 99%
“…As mentioned in Section 2.2, recent state-of-the-art performance [16,21] on dialogue state tracking is achieved by the reformulation of such a structure prediction task as a text generation task. In addition, Byrne et al [3] investigate the unified Seq2Seq paradigm for action prediction in transaction-based dialogue systems. However, there are more complicated subtasks with diverse paradigms in MG-CRS, varying from multi-label classification, ranking/recommendation, to text generation.…”
Section: Paradigm Shift and Prompt-based Learningmentioning
confidence: 99%
“…Therefore, we use task-oriented dialogue data as an proxy. We choose TaskMaster (Byrne et al, 2019(Byrne et al, , 2021 over other task-oriented datasets such as synthesis datasets (e.g., MultiWOZ (Budzianowski et al, 2018)) and datasets for scheduling (e.g., KVRET (Eric et al, 2017)), because TaskMaster is collected from real human dialogue and contains a more diverse set of scenarios including restaurant suggestions, movie recommendations, flight reservations, and etc.…”
Section: Task-oriented Dialogue Datasetsmentioning
confidence: 99%