2021
DOI: 10.48550/arxiv.2111.02574
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Contextual Semantic Parsing for Multilingual Task-Oriented Dialogues

Abstract: Robust state tracking for task-oriented dialogue systems currently remains restricted to a few popular languages. This paper shows that given a large-scale dialogue data set in one language, we can automatically produce an effective semantic parser for other languages using machine translation. We propose automatic translation of dialogue datasets with alignment to ensure faithful translation of slot values and eliminate costly human supervision used in previous benchmarks. We also propose a new contextual sem… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(6 citation statements)
references
References 25 publications
0
6
0
Order By: Relevance
“…Since the dataset for target languages is introduced in this paper, there is only prior work on the Chinese dataset. In Section 7.3, we compare our results to the best previously reported result on RiSAWOZ from Moradshahi et al (2021) that achieved SOTA on the DST subtask using an mBART model, and from Quan et al (2020) for other subtasks which use DAMD , a Seq2Seq RNN end-to-end dialogue model. We use seven widely-used automatic metrics to compare different models.…”
Section: Modelsmentioning
confidence: 99%
See 2 more Smart Citations
“…Since the dataset for target languages is introduced in this paper, there is only prior work on the Chinese dataset. In Section 7.3, we compare our results to the best previously reported result on RiSAWOZ from Moradshahi et al (2021) that achieved SOTA on the DST subtask using an mBART model, and from Quan et al (2020) for other subtasks which use DAMD , a Seq2Seq RNN end-to-end dialogue model. We use seven widely-used automatic metrics to compare different models.…”
Section: Modelsmentioning
confidence: 99%
“…A portion of their evaluation and test sets were post-edited by humans, while the training set remained entirely machine translated. Moradshahi et al (2021) translated RiSAWOZ to English and German using open-source machine translation models with alignment. However, the validation and test data were not verified by humans, resulting in potentially over-estimating the accuracy of agents.…”
Section: Multilingual Dialogue Datasetsmentioning
confidence: 99%
See 1 more Smart Citation
“…Target logical form y i tgt has the same signature as y i eng and only differs in terms of the translated slot values. Most translation based approaches (Moradshahi et al, 2020(Moradshahi et al, , 2021Xia and Monti, 2021;Nicosia et al, 2021) translate an English example (x i eng , y i eng ) to the corresponding target language example (x i tgt , y i tgt ) via a two step process: (i) Translate: Use a supervised translation service to convert the English utterance x i eng into the target language utterance x i tgt ; and (ii) Project: Replace the English slot values in y i eng with spans copied from the translated utterance x i tgt via a learned alignment model. The translated examples are then used to train a downstream multilingual semantic parser.…”
Section: Translating Semantic Parsing Datasetsmentioning
confidence: 99%
“…A common approach to automatic multilingual dataset creation is translating existing English datasets into target languages. Prior methods utilize an off-the-shelf machine translation model for translating the English utterance into the target language x eng → x tgt , followed by projecting language specific components in the English logicalform y eng to obtain the logical-form y tgt in the target language (Moradshahi et al, 2020(Moradshahi et al, , 2021Xia and Monti, 2021;Nicosia et al, 2021;Gritta et al, 2022;. The projection step is often learned independent of the translation service, resulting in poor generalization across languages.…”
Section: Introductionmentioning
confidence: 99%