Naver Labs Europe’s Systems for the WMT19 Machine Translation Robustness Task

Bérard, Alexandre; Calapodescu, Ioan; Roux, Claude

doi:10.18653/v1/w19-5361

Cited by 42 publications

(64 citation statements)

References 19 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Data Cleaning Data cleaning played an important part in training successful MT systems in this campaign. Unlike other participants, the winning team Naver Labs Bérard et al (2019) and NTT (Murakami et al, 2019) applied data cleaning techniques in order to filter noisy parallel sentences. They filtered i) identical sentences on source and target side, ii) sentences that belonged to a language other than the source and target language, iii) sentences with length mismatch, and iv) also applied attention-based filtering.…”

Section: Summary Of Methodsmentioning

confidence: 99%

“…The final submission is an ensemble of 4 models. NaverLabsEurope(NLE)' submission (Bérard et al, 2019): The participants carried substantial effort to clean the CommonCrawl data, applying length filtering (length ratio threshold), language identification-based filtering, and attention based filtering. They used the Transformer-Big architecture for Fra→Eng and Jpn→Eng, and Transformer-Base for the Eng→Jpn direction.…”

Section: Fokus' Submissionmentioning

confidence: 99%

See 1 more Smart Citation

Findings of the First Shared Task on Machine Translation Robustness

Wang¹,

Michel²,

Anastasopoulos³

et al. 2019

Proceedings of the Fourth Conference on Machine Translation (Volume 2: Shared Task Papers, Day 1)

View full text Add to dashboard Cite

We share the findings of the first shared task on improving robustness of Machine Translation (MT). The task provides a testbed representing challenges facing MT models deployed in the real world, and facilitates new approaches to improve models' robustness to noisy input and domain mismatch. We focus on two language pairs (English-French and English-Japanese), and the submitted systems are evaluated on a blind test set consisting of noisy comments on Reddit 1 and professionally sourced translations. As a new task, we received 23 submissions by 11 participating teams from universities, companies, national labs, etc. All submitted systems achieved large improvements over baselines, with the best improvement having +22.33 BLEU. We evaluated submissions by both human judgment and automatic evaluation (BLEU), which shows high correlations (Pearson's r = 0.94 and 0.95). Furthermore, we conducted a qualitative analysis of the submitted systems using compare-mt 2 , which revealed their salient differences in handling challenges in this task. Such analysis provides additional insights when there is occasional disagreement between human judgment and BLEU, e.g. systems better at producing colloquial expressions received higher score from human judgment.

show abstract

Section: Summary Of Methodsmentioning

confidence: 99%

Section: Fokus' Submissionmentioning

confidence: 99%

Findings of the First Shared Task on Machine Translation Robustness

Wang¹,

Michel²,

Anastasopoulos³

et al. 2019

Proceedings of the Fourth Conference on Machine Translation (Volume 2: Shared Task Papers, Day 1)

View full text Add to dashboard Cite

show abstract

“…Architecture We use the Transformer architecture (Vaswani et al, 2017), implemented in fairseq (Ott et al, 2019), which we modify to include monolingual and bilingual adapters. We train a joint BPE model (Sennrich et al, 2016) on all languages, with inline casing (Berard et al, 2019) and 64k merge operations (resulting in a 70k vocabulary size). The Transformer architecture used in this work 3 has 4 attention heads, 6 encoder layers, 6 decoder layers, an embedding size of 512 and a feed-forward dimension of 1024.…”

Section: Trainingmentioning

confidence: 99%

Monolingual Adapters for Zero-Shot Neural Machine Translation

Philip¹,

Bérard²,

Gallé³

et al. 2020

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)

Self Cite

View full text Add to dashboard Cite

We propose a novel adapter layer formalism for adapting multilingual models. They are more parameter-efficient than existing adapter layers while obtaining as good or better performance. The layers are specific to one language (as opposed to bilingual adapters) allowing to compose them and generalize to unseen language-pairs. In this zero-shot setting, they obtain a median improvement of +2.77 BLEU points over a strong 20-language multilingual Transformer baseline trained on TED talks. * Work done during an internship at NAVER LABS Europe.

show abstract

“…Michel and Neubig (2018) introduced a dataset scraped from Reddit for testing the NMT systems on the noisy text. Recently, a shared task on building the robust NMT models was held Bérard et al, 2019).…”

Section: Related Workmentioning

confidence: 99%

Adversarial Subword Regularization for Robust Neural Machine Translation

Park¹,

Sung²,

Lee³

et al. 2020

Findings of the Association for Computational Linguistics: EMNLP 2020

View full text Add to dashboard Cite

Exposing diverse subword segmentations to neural machine translation (NMT) models often improves the robustness of machine translation as NMT models can experience various subword candidates. However, the diversification of subword segmentations mostly relies on the pre-trained subword language models from which erroneous segmentations of unseen words are less likely to be sampled. In this paper, we present adversarial subword regularization (ADVSR) to study whether gradient signals during training can be a substitute criterion for exposing diverse subword segmentations. We experimentally show that our model-based adversarial samples effectively encourage NMT models to be less sensitive to segmentation errors and improve the performance of NMT models in low-resource and out-domain datasets.

show abstract

Naver Labs Europe’s Systems for the WMT19 Machine Translation Robustness Task

Cited by 42 publications

References 19 publications

Findings of the First Shared Task on Machine Translation Robustness

Findings of the First Shared Task on Machine Translation Robustness

Monolingual Adapters for Zero-Shot Neural Machine Translation

Adversarial Subword Regularization for Robust Neural Machine Translation

Contact Info

Product

Resources

About