Proceedings of the Fourth Arabic Natural Language Processing Workshop 2019
DOI: 10.18653/v1/w19-4617
|View full text |Cite
|
Sign up to set email alerts
|

Translating Between Morphologically Rich Languages: An Arabic-to-Turkish Machine Translation System

Abstract: This paper introduces the work on building a machine translation system for Arabic-to-Turkish in the news domain. Our work includes collecting parallel datasets in several ways for a new and low-resource language pair, building baseline systems with state-ofthe-art architectures and developing language specific algorithms for better translation. Parallel datasets are mainly collected three different ways; i) translating Arabic texts into Turkish by professional translators, ii) exploiting the web for open-sour… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 14 publications
(10 reference statements)
0
1
0
Order By: Relevance
“…All numbers are based on the corpora as available from OPUS parallel corpora collection http://opus.nlpl.eu/. on MT between Turkish and mainly English (e.g., Durgar El-Kahlout and Oflazer, 2010;Oflazer et al, 2018;Durgar El-Kahlout et al, 2019). Similarly, shared tasks which included Turkish as one of the languages, such as two IWLST shared tasks (Paul et al, 2010;Cettolo et al, 2013), and WMT shared tasks between 2016 and 2018 (Bojar et al, 2016), also provided data for use during the shared tasks.…”
Section: Parallel Corporamentioning
confidence: 99%
“…All numbers are based on the corpora as available from OPUS parallel corpora collection http://opus.nlpl.eu/. on MT between Turkish and mainly English (e.g., Durgar El-Kahlout and Oflazer, 2010;Oflazer et al, 2018;Durgar El-Kahlout et al, 2019). Similarly, shared tasks which included Turkish as one of the languages, such as two IWLST shared tasks (Paul et al, 2010;Cettolo et al, 2013), and WMT shared tasks between 2016 and 2018 (Bojar et al, 2016), also provided data for use during the shared tasks.…”
Section: Parallel Corporamentioning
confidence: 99%