A Survey of Word Reordering in Statistical Machine Translation: Computational Models and Language Phenomena

Bisazza, Arianna; Federico, Marcello

doi:10.1162/coli_a_00245

Cited by 26 publications

(15 citation statements)

References 82 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In PBSMT, there has been a substantial amount of research works about reordering model, which was used as a key component to ensure the generation of fluent target translation. Bisazza and Federico (2016) divided these reordering models into four groups: Phrase orientation models (Tillman, 2004;Collins et al, 2005;Nagata et al, 2006;Zens and Ney, 2006;Galley and Manning, 2008;Cherry, 2013), simply known as lexicalized reordering models, predict whether the next translated source span should be placed on the right (monotone), the left (swap), or anywhere else (discontinuous) of the last translated one.…”

Section: Reordering Model For Pbsmtmentioning

confidence: 99%

Neural Machine Translation with Reordering Embeddings

Chen¹,

Wang²,

Utiyama³

et al. 2019

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics

View full text Add to dashboard Cite

The reordering model plays an important role in phrase-based statistical machine translation. However, there are few works that exploit the reordering information in neural machine translation. In this paper, we propose a reordering mechanism to learn the reordering embedding of a word based on its contextual information. These reordering embeddings are stacked together with self-attention networks to learn sentence representation for machine translation. The reordering mechanism can be easily integrated into both the encoder and the decoder in the Transformer translation system. Experimental results on WMT'14 English-to-German, NIST Chinese-to-English, and WAT ASPEC Japanese-to-English translation tasks demonstrate that the proposed methods can significantly improve the performance of the Transformer translation system.

show abstract

Section: Reordering Model For Pbsmtmentioning

confidence: 99%

Neural Machine Translation with Reordering Embeddings

Chen¹,

Wang²,

Utiyama³

et al. 2019

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics

View full text Add to dashboard Cite

show abstract

“…Commonly, SMT systems are trained using reference translations by which ML algorithms are able to analyze the data and find patterns by themselves, thus being able to translate text without any rules created by humans. Although some basic linguistics mistakes have been solved by Tree-based and Neural Networkbased approaches, the lack of complex linguistic rules still causes ambiguity problems (e.g., errors on relative pronouns)- [149]. An additional problem of the latter approach is the complexity of performing error analysis over outputs.…”

Section: Mt Performance and Effortmentioning

confidence: 99%

“…2. Structural divergence: By definition, structural reordering is reorganizing the order of the syntactic constituents of a language according to its original structure [149]. It in turn becomes a critical issue because it is the core of the translation process.…”

Section: Open Mt Challengesmentioning

confidence: 99%

Machine Translation using Semantic Web Technologies: A Survey

Moussallem

Wauer

Ngomo

2018

Journal of Web Semantics

View full text Add to dashboard Cite

A large number of machine translation approaches have recently been developed to facilitate the fluid migration of content across languages. However, the literature suggests that many obstacles must still be dealt with to achieve better automatic translations. One of these obstacles is lexical and syntactic ambiguity. A promising way of overcoming this problem is using Semantic Web technologies. This article presents the results of a systematic review of machine translation approaches that rely on Semantic Web technologies for translating texts. Overall, our survey suggests that while Semantic Web technologies can enhance the quality of machine translation outputs for various problems, the combination of both is still in its infancy.

show abstract

“…They may be deterministic (i.e., leading to a single reordered variant of the given source sentence) or non-deterministic (i.e., leading to several variants of the source sentence). An extensive overview of different preordering approaches is presented by Bisazza and Federico (2016). Xu et al (2009) and Nakagawa (2015) proposed preordering methods which can be applied to many different language pairs.…”

Section: Related Workmentioning

confidence: 99%

“…English and Japanese differ in many syntactic aspects: the order of the clauses is different, as well as the order of the words within the clauses. An extensive overview of the differences on various syntactic levels can be found in Bisazza and Federico (2016). The rule set for Japanese is taken from Lee et al (2010).…”

Section: Reordering Rulesmentioning

confidence: 99%

Integration of a Multilingual Preordering Component into a Commercial SMT Platform

Ramm

Superbo

Shterionov

et al. 2017

The Prague Bulletin of Mathematical Linguistics

View full text Add to dashboard Cite

We present a multilingual preordering component tailored for a commercial Statistical Machine translation platform. In commercial settings, issues such as processing speed as well as the ability to adapt models to the customers’ needs play a significant role and have a big impact on the choice of approaches that are added to the custom pipeline to deal with specific problems such as long-range reorderings.We developed a fast and customisable preordering component, also available as an open-source tool, which comes along with a generic implementation that is restricted neither to the translation platform nor to the Machine Translation paradigm. We test preordering on three language pairs: English →Japanese/German/Chinese for both Statistical Machine Translation (SMT) and Neural Machine Translation (NMT). Our experiments confirm previously reported improvements in the SMT output when the models are trained on preordered data, but they also show that preordering does not improve NMT.

show abstract

A Survey of Word Reordering in Statistical Machine Translation: Computational Models and Language Phenomena

Cited by 26 publications

References 82 publications

Neural Machine Translation with Reordering Embeddings

Neural Machine Translation with Reordering Embeddings

Machine Translation using Semantic Web Technologies: A Survey

Integration of a Multilingual Preordering Component into a Commercial SMT Platform

Contact Info

Product

Resources

About