Proceedings of the Third Workshop on Statistical Machine Translation - StatMT '08 2008
DOI: 10.3115/1626394.1626404
|View full text |Cite
|
Sign up to set email alerts
|

LIMSI's statistical translation systems for WMT'08

Abstract: This paper describes our statistical machine translation systems based on the Moses toolkit for the WMT08 shared task. We address the Europarl and News conditions for the following language pairs: English with French, German and Spanish. For Europarl, n-best rescoring is performed using an enhanced n-gram or a neuronal language model; for the News condition, language models incorporate extra training data. We also report unconvincing results of experiments with factored models.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
10
0

Year Published

2009
2009
2017
2017

Publication Types

Select...
6
3

Relationship

5
4

Authors

Journals

citations
Cited by 12 publications
(10 citation statements)
references
References 8 publications
0
10
0
Order By: Relevance
“…The pre-processing of English data relies on inhouse tools (Déchelotte et al, 2008). All the Czech data were tokenized and truecased using the Moses toolkit (Koehn et al, 2007).…”
Section: Data and Preprocessingmentioning
confidence: 99%
“…The pre-processing of English data relies on inhouse tools (Déchelotte et al, 2008). All the Czech data were tokenized and truecased using the Moses toolkit (Koehn et al, 2007).…”
Section: Data and Preprocessingmentioning
confidence: 99%
“…Tokenization for English text relies on in-house text processing tools (Déchelotte et al, 2008). For the Russian corpora, we used the TreeTagger tokenizer.…”
Section: Data Pre-processing and Word Alignmentsmentioning
confidence: 99%
“…The preprocessing of English data relies on in-house tools (Déchelotte et al, 2008). All the Czech data were tokenized and truecased the Moses toolkit (Koehn et al, 2007).…”
Section: Data and Preprocessingmentioning
confidence: 99%