Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2021
DOI: 10.18653/v1/2021.emnlp-main.475
|View full text |Cite
|
Sign up to set email alerts
|

A Large-Scale Study of Machine Translation in Turkic Languages

Abstract: Recent advances in neural machine translation (NMT) have pushed the quality of machine translation systems to the point where they are becoming widely adopted for building competitive systems. However, there is still a large number of languages that are yet to reap the benefits of NMT. In this paper, we provide the first large-scale case study of the practical application of MT in the Turkic language family in order to realize the gains of NMT for Turkic languages under high-resource to extremely low-resource … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(2 citation statements)
references
References 48 publications
0
2
0
Order By: Relevance
“…For all languages except Uzbek, we use the WMT shared task data from He et al (2020). For Uzbek, we use the Turkic Interlingua corpus (Mirzakhalov et al, 2021).…”
Section: Methodsmentioning
confidence: 99%
“…For all languages except Uzbek, we use the WMT shared task data from He et al (2020). For Uzbek, we use the Turkic Interlingua corpus (Mirzakhalov et al, 2021).…”
Section: Methodsmentioning
confidence: 99%
“…The field of natural language processing (NLP) has become aware that most of the world's languages are unfortunately under-represented, or entirely absent, from the field's body of work (Joshi et al, 2020). In recent years, there has been a push in efforts to ameliorate this discrepancy (Nekoto et al, 2020;Mirzakhalov et al, 2021;Ogueji et al, 2021). Among these low-resourced languages 1 are Creole languages, which are particularly under-resourced due to barriers like societal stigma (Siegel, 1999), despite the fact that these languages are spoken by many people globally.…”
Section: Introductionmentioning
confidence: 99%