2021
DOI: 10.48550/arxiv.2108.03265
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Facebook AI WMT21 News Translation Task Submission

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 6 publications
(8 citation statements)
references
References 0 publications
0
8
0
Order By: Relevance
“…Assessing zero-shot performance in non-English languages is challenging due to the limited number of human-curated benchmarks available. However, with the exception of recent work in machine translation [4], multilingual models generally perform worse than mono-or bilingual language models [5].…”
Section: Introductionmentioning
confidence: 93%
See 1 more Smart Citation
“…Assessing zero-shot performance in non-English languages is challenging due to the limited number of human-curated benchmarks available. However, with the exception of recent work in machine translation [4], multilingual models generally perform worse than mono-or bilingual language models [5].…”
Section: Introductionmentioning
confidence: 93%
“…Each percentile as well as the tails of both the loss and the toxicity distribution were sampled and manually inspected to find suitable cut-off values for filtering. The inspection of these samples revealed that both toxicity and loss values were appropriate 4 . We removed documents corresponding to a toxicity score higher than 0.5, corresponding to 0.25% of the content (0.8M documents).…”
Section: Training Datamentioning
confidence: 99%
“…Neural machine translation (NMT) (Bahdanau et al, 2014;Cho et al, 2014) tackles this problem by modeling translation as an end to end process using neural networks. In current machine translation research, the Transformer architecture (Vaswani et al, 2017) is almost exclusively used in supervised settings (Tran et al, 2021;Germann et al, 2021;Oravecz et al, 2020). For the Hungarian-English language-pair, published methods followed the same evolution of rule-based systems (Prószéky and Tihanyi, 2002), statistical methods (Laki et al, 2013) and neural models (Tihanyi and Oravecz, 2017).…”
Section: Related Workmentioning
confidence: 99%
“…Yet, for many low-resource translation tasks, English is often significantly distant from the target low-resource counterparts, such as Nepali and Sinhala, which in fact share certain similarities between themselves. Thus, English may not share similar structural patterns with any of the target languages [31]. Despite that, most existing UMT models are bidirectional [9,29,22].…”
Section: Second Stagementioning
confidence: 99%
“…Our proposed method aims to gradually disentangle the languages and focus on the target low-resource directions only. One prominent feature of our method is that it prioritises to disentangle English from the remaining low-resource languages first, as these languages are often much more distant from the common language English than from themselves [31].…”
Section: Introductionmentioning
confidence: 99%