Proceedings of the First Conference on Machine Translation: Volume 2, Shared Task Papers 2016
DOI: 10.18653/v1/w16-2322
|View full text |Cite
|
Sign up to set email alerts
|

Abu-MaTran at WMT 2016 Translation Task: Deep Learning, Morphological Segmentation and Tuning on Character Sequences

Abstract: This paper presents the systems submitted by the Abu-MaTran project to the Englishto-Finnish language pair at the WMT 2016 news translation task. We applied morphological segmentation and deep learning in order to address (i) the data scarcity problem caused by the lack of in-domain parallel data in the constrained task and (ii) the complex morphology of Finnish. We submitted a neural machine translation system, a statistical machine translation system reranked with a neural language model and the combination … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
2
0

Year Published

2017
2017
2020
2020

Publication Types

Select...
4
1
1

Relationship

1
5

Authors

Journals

citations
Cited by 14 publications
(2 citation statements)
references
References 17 publications
0
2
0
Order By: Relevance
“…Neural MT: NMT is a new approach to statistical MT, which, while having been introduced only very recently, 2 has already shown great potential, as there is evidence that it can attain better translation quality than the dominant approach to date, namely phrase-based statistical MT (PBSMT). This has been shown for a number of language pairs and domains, including transcribed speeches (Luong and Manning, 2015), newswire (Sánchez-Cartagena and Toral, 2016) and United Nations documents (Junczys-Dowmunt et al, 2016). Beyond its generally positive performance, NMT is of particular interest for literary texts due to the following two findings:…”
mentioning
confidence: 86%
“…Neural MT: NMT is a new approach to statistical MT, which, while having been introduced only very recently, 2 has already shown great potential, as there is evidence that it can attain better translation quality than the dominant approach to date, namely phrase-based statistical MT (PBSMT). This has been shown for a number of language pairs and domains, including transcribed speeches (Luong and Manning, 2015), newswire (Sánchez-Cartagena and Toral, 2016) and United Nations documents (Junczys-Dowmunt et al, 2016). Beyond its generally positive performance, NMT is of particular interest for literary texts due to the following two findings:…”
mentioning
confidence: 86%
“…Bradbury and Socher (Bradbury et al, 2014) employed the modified Morfessor to provide morphology knowledge into word segmentation, but they neglected the morphological varieties between subword units, which might result in am- and Toral (Sánchez-Cartagena et al, 2016) proposed a rule-based morphological word segmentation for Finnish, which applies BPE on all the morpheme units uniformly without distinguishing their inner morphological roles. Huck (Huck et al, 2017) explored target-side segmentation method for German, which shows that the cascading of suffix splitting and compound splitting with BPE can achieve better translation results.…”
Section: Related Workmentioning
confidence: 99%