2021
DOI: 10.48550/arxiv.2110.06800
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

SGD-X: A Benchmark for Robust Generalization in Schema-Guided Dialogue Systems

Harrison Lee,
Raghav Gupta,
Abhinav Rastogi
et al.

Abstract: Zero/few-shot transfer to unseen services is a critical challenge in task-oriented dialogue research. The Schema-Guided Dialogue (SGD) dataset introduced a paradigm for enabling models to support an unlimited number of services without additional data collection or re-training through the use of schemas. Schemas describe service APIs in natural language, which models consume to understand the services they need to support. However, the impact of the choice of language in these schemas on model performance rema… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 19 publications
0
2
0
Order By: Relevance
“…The test set includes unseen schemas to encourage model generalization. SGD-X is an extension designed to study model robustness to different schema wordings (Lee et al, 2021b).…”
Section: Datasetsmentioning
confidence: 99%
See 1 more Smart Citation
“…The test set includes unseen schemas to encourage model generalization. SGD-X is an extension designed to study model robustness to different schema wordings (Lee et al, 2021b).…”
Section: Datasetsmentioning
confidence: 99%
“…Learning robust models requires diverse datasets that represent real-world challenges. In this sense, several evaluation benchmarks have been published to study TOD systems' robustness (Lee et al, 2021b;Peng et al, 2021b;. Data augmentation is a potential solution to the lack of variety in datasets (Campagna et al, 2020;Li et al, 2021a;Aksu et al, 2022), especially for simulating ASR errors (Wang et al, 2020;Tian et al, 2021b).…”
Section: Robustnessmentioning
confidence: 99%