Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 2021
DOI: 10.18653/v1/2021.findings-acl.264
|View full text |Cite
|
Sign up to set email alerts
|

Language Tags Matter for Zero-Shot Neural Machine Translation

Abstract: Multilingual Neural Machine Translation (MNMT) has aroused widespread interest due to its efficiency. An exciting advantage of MNMT models is that they could also translate between unsupervised (zero-shot) language directions.Language tag (LT) strategies are often adopted to indicate the translation directions in MNMT. In this paper, we demonstrate that the LTs are not only indicators for translation directions but also crucial to zero-shot translation qualities. Unfortunately, previous work tends to ignore th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(2 citation statements)
references
References 28 publications
0
2
0
Order By: Relevance
“…This finding disagrees with that of Wang et al (2021), which we ascribe to various differences in model, data and optimization. Note that Wang et al (2021) adopted more aggressive data oversampling, didn't consider distant languages, proposed dedicated optimization with the source-side loss, used a different way to count model parameters, and designed different language tags for multilingual translation that could greatly affect zero-shot results (Wu et al, 2021). We leave the study of these differences to the future.…”
Section: Experiments For Cross-lingual Transfermentioning
confidence: 99%
“…This finding disagrees with that of Wang et al (2021), which we ascribe to various differences in model, data and optimization. Note that Wang et al (2021) adopted more aggressive data oversampling, didn't consider distant languages, proposed dedicated optimization with the source-side loss, used a different way to count model parameters, and designed different language tags for multilingual translation that could greatly affect zero-shot results (Wu et al, 2021). We leave the study of these differences to the future.…”
Section: Experiments For Cross-lingual Transfermentioning
confidence: 99%
“…Raganato et al [32] proposed an auxiliary supervised objective that explicitly aligns the words in language pairs to alleviate the off-target problem of zero-shot translation. Wu et al [33] empirically compared several language tag strategies and recommended adding the target language tag to the encoder for zero-shot translation. Gonzales et al [34] improved the zero-shot translation by performing language-specific subword segmentation and including the non-English pairs in English-centric training data.…”
Section: Zero-shot Neural Machine Translationmentioning
confidence: 99%