Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers on XX - NAACL '06 2006
DOI: 10.3115/1614049.1614100
|View full text |Cite
|
Sign up to set email alerts
|

Bridging the inflection morphology gap for Arabic statistical machine translation

Abstract: Statistical machine translation (SMT) is based on the ability to effectively learn word and phrase relationships from parallel corpora, a process which is considerably more difficult when the extent of morphological expression differs significant across the source and target languages. We present techniques that select appropriate word segmentations in the morphologically rich source language based on contextual relationships in the target language. Our results take advantage of existing word level morphologic… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
22
0

Year Published

2009
2009
2016
2016

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 33 publications
(23 citation statements)
references
References 10 publications
1
22
0
Order By: Relevance
“…They also demonstrated gains in MT quality through a combination of different preprocessing schemes. Additional similar results were reported using specific preprocessing schemes and techniques (Och 2005;El Isbihani et al 2006;Riesa and Yarowsky 2006;Zollmann et al 2006).…”
Section: Morphology-based Approachessupporting
confidence: 68%
“…They also demonstrated gains in MT quality through a combination of different preprocessing schemes. Additional similar results were reported using specific preprocessing schemes and techniques (Och 2005;El Isbihani et al 2006;Riesa and Yarowsky 2006;Zollmann et al 2006).…”
Section: Morphology-based Approachessupporting
confidence: 68%
“…Integrating the previous researches [1][2][3][4][5][6][7][8][9][10], a lot of literatures have reported label corpus automatic achieving study aiming at word sense disambiguation in a monolingual. Different from tasks of word sense disambiguation, Uyghur translation disambiguation is to solve the correct translation of Uyghur ambiguous terms in target language in nature, i.e., to find the most suitable Chinese translation for Uyghur ambiguous terms according to the context.…”
Section: Introductionmentioning
confidence: 99%
“…Lee [21] uses a morphologically analyzed and tagged parallel corpus for Arabic-English statistical machine translation. Zolmann et al [22] also exploit morphology in Arabic-English statistical machine translation. Popovic and Ney [23] investigate improving translation quality from inflected languages by using stems, suffixes and part-of-speech tags.…”
Section: Exploiting Morphologymentioning
confidence: 99%