2016
DOI: 10.1162/coli_a_00245
|View full text |Cite
|
Sign up to set email alerts
|

A Survey of Word Reordering in Statistical Machine Translation: Computational Models and Language Phenomena

Abstract: Word reordering is one of the most difficult aspects of statistical machine translation (SMT), and an important factor of its quality and efficiency. Despite the vast amount of research published to date, the interest of the community in this problem has not decreased, and no single method appears to be strongly dominant across language pairs. Instead, the choice of the optimal approach for a new translation task still seems to be mostly driven by empirical trials.To orientate the reader in this vast and compl… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
15
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 26 publications
(15 citation statements)
references
References 82 publications
0
15
0
Order By: Relevance
“…In PBSMT, there has been a substantial amount of research works about reordering model, which was used as a key component to ensure the generation of fluent target translation. Bisazza and Federico (2016) divided these reordering models into four groups: Phrase orientation models (Tillman, 2004;Collins et al, 2005;Nagata et al, 2006;Zens and Ney, 2006;Galley and Manning, 2008;Cherry, 2013), simply known as lexicalized reordering models, predict whether the next translated source span should be placed on the right (monotone), the left (swap), or anywhere else (discontinuous) of the last translated one.…”
Section: Reordering Model For Pbsmtmentioning
confidence: 99%
“…In PBSMT, there has been a substantial amount of research works about reordering model, which was used as a key component to ensure the generation of fluent target translation. Bisazza and Federico (2016) divided these reordering models into four groups: Phrase orientation models (Tillman, 2004;Collins et al, 2005;Nagata et al, 2006;Zens and Ney, 2006;Galley and Manning, 2008;Cherry, 2013), simply known as lexicalized reordering models, predict whether the next translated source span should be placed on the right (monotone), the left (swap), or anywhere else (discontinuous) of the last translated one.…”
Section: Reordering Model For Pbsmtmentioning
confidence: 99%
“…Commonly, SMT systems are trained using reference translations by which ML algorithms are able to analyze the data and find patterns by themselves, thus being able to translate text without any rules created by humans. Although some basic linguistics mistakes have been solved by Tree-based and Neural Networkbased approaches, the lack of complex linguistic rules still causes ambiguity problems (e.g., errors on relative pronouns)- [149]. An additional problem of the latter approach is the complexity of performing error analysis over outputs.…”
Section: Mt Performance and Effortmentioning
confidence: 99%
“…2. Structural divergence: By definition, structural reordering is reorganizing the order of the syntactic constituents of a language according to its original structure [149]. It in turn becomes a critical issue because it is the core of the translation process.…”
Section: Open Mt Challengesmentioning
confidence: 99%
“…They may be deterministic (i.e., leading to a single reordered variant of the given source sentence) or non-deterministic (i.e., leading to several variants of the source sentence). An extensive overview of different preordering approaches is presented by Bisazza and Federico (2016). Xu et al (2009) and Nakagawa (2015) proposed preordering methods which can be applied to many different language pairs.…”
Section: Related Workmentioning
confidence: 99%
“…English and Japanese differ in many syntactic aspects: the order of the clauses is different, as well as the order of the words within the clauses. An extensive overview of the differences on various syntactic levels can be found in Bisazza and Federico (2016). The rule set for Japanese is taken from Lee et al (2010).…”
Section: Reordering Rulesmentioning
confidence: 99%