Moses

Koehn, Philipp; Hoang, Hieu; Birch, Alexandra; Callison-Burch, Chris; Federico, Marcello; Bertoldi, Nicola; Cowan, Brooke; Shen, Wade; Moran, Christine; Zens, Richard; Dyer, Chris; Bojar, Ondřej; Constantin, Alexandra; Herbst, Evan

doi:10.3115/1557769.1557821

Cited by 2,296 publications

(221 citation statements)

References 9 publications

Supporting

Mentioning

219

Contrasting

Unclassified

Order By: Relevance

“…The pairs were aligned using GIZA++ and the phrase extractor and scorer from the Moses machine translation package (Koehn et al, 2007) . To apply a machine translation analogy, we treated words as sentences and the letters from which were constructed as tokens.…”

Section: Arabizi To Arabicmentioning

confidence: 99%

Arabizi Detection and Conversion to Arabic

Darwish

2014

Proceedings of the EMNLP 2014 Workshop on Arabic Natural Language Processing (ANLP)

View full text Add to dashboard Cite

Arabizi is Arabic text that is written using Latin characters. Arabizi is used to present both Modern Standard Arabic (MSA) or Arabic dialects. It is commonly used in informal settings such as social networking sites and is often with mixed with English. In this paper we address the problems of: identifying Arabizi in text and converting it to Arabic characters. We used word and sequence-level features to identify Arabizi that is mixed with English. We achieved an identification accuracy of 98.5%. As for conversion, we used transliteration mining with language modeling to generate equivalent Arabic text. We achieved 88.7% conversion accuracy, with roughly a third of errors being spelling and morphological variants of the forms in ground truth.

show abstract

Section: Arabizi To Arabicmentioning

confidence: 99%

Arabizi Detection and Conversion to Arabic

Darwish

2014

Proceedings of the EMNLP 2014 Workshop on Arabic Natural Language Processing (ANLP)

View full text Add to dashboard Cite

show abstract

“…We use mteval from the Moses toolkit (Koehn et al, 2007) and TERCom to evaluate our systems on the BLEU (Papineni et al, 2002) and TER (Snover et al, 2006) measures. Additional we use BEER (Stanojević and Sima'an, 2014) and CTER (Wang et al, 2016).…”

Section: Smt Systemsmentioning

confidence: 99%

The RWTH Aachen University English-German and German-English Machine Translation System for WMT 2017

Peter¹,

Guta²,

Alkhouli³

et al. 2017

Proceedings of the Second Conference on Machine Translation

View full text Add to dashboard Cite

This paper describes the statistical machine translation system developed at RWTH Aachen University for the English→German and German→English translation tasks of the EMNLP 2017 Second Conference on Machine Translation (WMT 2017). We use ensembles of attention-based neural machine translation system for both directions. We use the provided parallel and synthetic data to train the models. In addition, we also create a phrasal system using joint translation and reordering models in decoding and neural models in rescoring.

show abstract

“…We used beam search with a beam width of 8 to approximately find the most likely translations given a source sentence before introducing features proposed by our language models and reranking with the default Moses (Koehn et al, 2007) implementation of K-best MIRA (Cherry and Foster, 2012). Both language models were trained on the English news data.…”

Section: Neural Baselinementioning

confidence: 99%

“…Additionally, we backtranslated a subset of these sentences and used the resulting source-target sentences to augment our training data. Our training and development data were lowercased and preprocessed using the Moses tokenizer script (Koehn et al, 2007), Jieba, and BPE. We set the upper bound on the target vocabulary to 30, 000 sub-words and two additional tokens reserved for EOS and U N K .…”

Section: Corpora and Preprocessingmentioning

confidence: 99%

University of Rochester WMT 2017 NMT System Submission

Holtz¹,

Ke²,

Gildea³

2017

Proceedings of the Second Conference on Machine Translation

View full text Add to dashboard Cite

We describe the neural machine translation system submitted by the University of Rochester to the Chinese-English language pair for the WMT 2017 news translation task. We applied unsupervised word and subword segmentation techniques and deep learning in order to address (i) the word segmentation problem caused by the lack of delimiters between words and phrases in Chinese and (ii) the morphological and syntactic differences between Chinese and English. We integrated promising recent developments in NMT, including back-translations, language model reranking, subword splitting and minimum risk tuning.

show abstract

Moses

Cited by 2,296 publications

References 9 publications

Arabizi Detection and Conversion to Arabic

Arabizi Detection and Conversion to Arabic

The RWTH Aachen University English-German and German-English Machine Translation System for WMT 2017

University of Rochester WMT 2017 NMT System Submission

Contact Info

Product

Resources

About