IEEE Workshop on Automatic Speech Recognition and Understanding, 2005. 2005
DOI: 10.1109/asru.2005.1566493
|View full text |Cite
|
Sign up to set email alerts
|

Leveraging multiple languages to improve statistical MT word alignments

Abstract: We present a new multilingual statistical MT word alignment model based on a simple extension of the IBM and HMM Models and a two-step alignment procedure. Preliminary results on a small hand-aligned subset of the Europarl corpus show a 7% relative improvement over a state of the art alignment model.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
5
0

Year Published

2005
2005
2021
2021

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(5 citation statements)
references
References 12 publications
0
5
0
Order By: Relevance
“…Kumar et al (2007) show that multiparallel corpora can be of benefit to reach better performance in phrase-based statistical machine translation (SMT). Filali and Bilmes (2005) present a multilingual SMT-based word alignment model, extending IBM models, based on HMM models and a two step alignment procedure. Since the goal of this research is to tackle word alignment directly without considering machine translation, these works are not considered here.…”
Section: Related Workmentioning
confidence: 99%
“…Kumar et al (2007) show that multiparallel corpora can be of benefit to reach better performance in phrase-based statistical machine translation (SMT). Filali and Bilmes (2005) present a multilingual SMT-based word alignment model, extending IBM models, based on HMM models and a two step alignment procedure. Since the goal of this research is to tackle word alignment directly without considering machine translation, these works are not considered here.…”
Section: Related Workmentioning
confidence: 99%
“…Moreover, it has been argued that English might not always be the optimal choice of a pivot language (Paul, Yamamoto, Sumita, & Nakamura, 2009). Finally, pivoting techniques have been also used at the word-level, e.g., for translation lexicon induction between Japanese and German using English (Tanaka, Murakami, & Ishida, 2009), or for improving word alignments (Filali & Bilmes, 2005;Kumar, Och, & Macherey, 2007). Pivot languages have also been used for lexical adaptation (Crego, Max, & Yvon, 2010).…”
Section: The Egyptian Wikipedia Is One Notable Exceptionmentioning
confidence: 99%
“…Some authors have studied how multilingual parallel corpora can be used to improve bitext alignment. Filali and Bilmes (2005) use (bitext) alignments to addditional languages as features in bitext alignment, while Kumar et al (2007) interpolate alignments through multiple bridge languages to produce a bitext alignment for another language pair. Since the goal of this research is not multilingual alignment, it will not be considered further here.…”
Section: Introductionmentioning
confidence: 99%