2013
DOI: 10.1162/tacl_a_00239
|View full text |Cite
|
Sign up to set email alerts
|

Measuring Machine Translation Errors in New Domains

Abstract: We develop two techniques for analyzing the effect of porting a machine translation system to a new domain. One is a macro-level analysis that measures how domain shift affects corpus-level evaluation; the second is a micro-level analysis for word-level errors. We apply these methods to understand what happens when a Parliament-trained phrase-based machine translation system is applied in four very different domains: news, medical texts, scientific articles and movie subtitles. We present quantitative and qual… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
39
0

Year Published

2014
2014
2019
2019

Publication Types

Select...
5
4
1

Relationship

1
9

Authors

Journals

citations
Cited by 54 publications
(39 citation statements)
references
References 9 publications
0
39
0
Order By: Relevance
“…In light of these apparent success, we examine the failure modes of existing models for morphological generation. We first propose and motivate an error taxonomy for this task, inspired by similar proposals for other natural language generation and processing technologies such as grammatical error correction (e.g., Rozovskaya and Roth 2016) and machine translation (e.g., Popović and Ney 2011, Fishel et al 2012, Irvine et al 2013. We then use this taxonomy to perform a manual error analysis of the CoNLL-SIGMORPHON 2017 Shared Task.…”
Section: Introductionmentioning
confidence: 99%
“…In light of these apparent success, we examine the failure modes of existing models for morphological generation. We first propose and motivate an error taxonomy for this task, inspired by similar proposals for other natural language generation and processing technologies such as grammatical error correction (e.g., Rozovskaya and Roth 2016) and machine translation (e.g., Popović and Ney 2011, Fishel et al 2012, Irvine et al 2013. We then use this taxonomy to perform a manual error analysis of the CoNLL-SIGMORPHON 2017 Shared Task.…”
Section: Introductionmentioning
confidence: 99%
“…In the real scenario, training data and test data have different distributions and the target domains are sometimes unseen. Irvine et al (2013) analyze the translation errors in such scenarios. Domain generalization aims to apply knowledge gained from labeled source domains to unseen target domains (Li et al, 2018).…”
Section: Adversarial Domain Adaptation and Domain Generationmentioning
confidence: 99%
“…There are some studies in the area of SMT evaluation, e.g. those dealing with the errors in translation of new domains (Irvine et al, 2013). However, the error types concern the lexical level only, as the authors operate solely with the notion of domain (field of discourse) and not register (which includes more parameters, see Section 2.1 above).…”
Section: Register In Translationmentioning
confidence: 99%