When a Good Translation is Wrong in Context: Context-Aware Machine Translation Improves on Deixis, Ellipsis, and Lexical Cohesion

Voita, Elena; Sennrich, Rico; Titov, Ivan

doi:10.18653/v1/p19-1116

Cited by 150 publications

(223 citation statements)

References 22 publications

Supporting

Mentioning

217

Contrasting

Unclassified

Order By: Relevance

“…Moreover, we notice that deixis scores are less sensitive to the amount of training data than lexical cohesion and ellipsis scores. The reason might be that, as we observed in our previous work (Voita et al, 2019), inconsistencies in translations due to the presence of deictic words and phrases are more frequent in this dataset than other types of inconsistencies. Also, as we show in Section 7, this is the phenomenon the model learns faster in training.…”

Section: Varying Training Datamentioning

confidence: 48%

“…As a second baseline, we use the two-pass CADec model (Voita et al, 2019). The first pass produces sentence-level translations.…”

Section: Modelsmentioning

confidence: 99%

“…Closest to our work are two-pass models for document-level NMT (Xiong et al, 2019;Voita et al, 2019), where a second, context-aware model takes the translation and hidden representations of the sentence-level first-pass model as its input.…”

Section: Document-level Nmtmentioning

confidence: 99%

See 2 more Smart Citations

Context-Aware Monolingual Repair for Neural Machine Translation

Voita¹,

Sennrich²,

Titov³

2019

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conferen

Self Cite

View full text Add to dashboard Cite

Modern sentence-level NMT systems often produce plausible translations of isolated sentences. However, when put in context, these translations may end up being inconsistent with each other. We propose a monolingual DocRepair model to correct inconsistencies between sentence-level translations. DocRepair performs automatic post-editing on a sequence of sentence-level translations, refining translations of sentences in context of each other. For training, the DocRepair model requires only monolingual document-level data in the target language. It is trained as a monolingual sequence-to-sequence model that maps inconsistent groups of sentences into consistent ones. The consistent groups come from the original training data; the inconsistent groups are obtained by sampling roundtrip translations for each isolated sentence. We show that this approach successfully imitates inconsistencies we aim to fix: using contrastive evaluation, we show large improvements in the translation of several contextual phenomena in an English→Russian translation task, as well as improvements in the BLEU score. We also conduct a human evaluation and show a strong preference of the annotators to corrected translations over the baseline ones. Moreover, we analyze which discourse phenomena are hard to capture using monolingual data only. 1 1 The code and data sets (including round-trip translations) are available at https://github.com/lena-voita/ good-translation-wrong-in-context.

show abstract

Section: Varying Training Datamentioning

confidence: 48%

“…As a second baseline, we use the two-pass CADec model (Voita et al, 2019). The first pass produces sentence-level translations.…”

Section: Modelsmentioning

confidence: 99%

See 1 more Smart Citation

Context-Aware Monolingual Repair for Neural Machine Translation

Voita¹,

Sennrich²,

Titov³

2019

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conferen

Self Cite

View full text Add to dashboard Cite

show abstract

“…In this section, we review the existing documentlevel approaches for NMT and describe our strategies to filter out uninteresting words in the context input. We illustrate with an example of including one previous source sentence as the documentlevel context, which can be easily generalized also to other context inputs such as target hypotheses (Agrawal et al, 2018;Bawden et al, 2018;Voita et al, 2019) or decoder states (Tu et al, 2018;Maruf and Haffari, 2018;Miculicich et al, 2018).…”

Section: Introductionmentioning

confidence: 99%

When and Why is Document-level Context Useful in Neural Machine Translation?

Kim¹,

Tran²,

Ney³

2019

Proceedings of the Fourth Workshop on Discourse in Machine Translation (DiscoMT 2019)

View full text Add to dashboard Cite

Document-level context has received lots of attention for compensating neural machine translation (NMT) of isolated sentences. However, recent advances in document-level NMT focus on sophisticated integration of the context, explaining its improvement with only a few selected examples or targeted test sets. We extensively quantify the causes of improvements by a document-level model in general test sets, clarifying the limit of the usefulness of document-level context in NMT. We show that most of the improvements are not interpretable as utilizing the context. We also show that a minimal encoding is sufficient for the context modeling and very long context is not helpful for NMT.

show abstract

“…The omission of the pronouns occurs more frequently in spoken language than written language. Recently, context-aware translation models attract attention from many researchers (Tiedemann and Scherrer, 2017;Voita et al, 2018Voita et al, , 2019 to solve this kind of problem, however, there are almost no conversational parallel corpora with context information except noisy OpenSubtitles corpus.…”

Section: Introductionmentioning

confidence: 99%

Designing the Business Conversation Corpus

Rikters

et al. 2019

Proceedings of the 6th Workshop on Asian Translation

View full text Add to dashboard Cite

While the progress of machine translation of written text has come far in the past several years thanks to the increasing availability of parallel corpora and corpora-based training technologies, automatic translation of spoken text and dialogues remains challenging even for modern systems. In this paper, we aim to boost the machine translation quality of conversational texts by introducing a newly constructed Japanese-English business conversation parallel corpus. A detailed analysis of the corpus is provided along with challenging examples for automatic translation. We also experiment with adding the corpus in a machine translation training scenario and show how the resulting system benefits from its use.

show abstract

When a Good Translation is Wrong in Context: Context-Aware Machine Translation Improves on Deixis, Ellipsis, and Lexical Cohesion

Cited by 150 publications

References 22 publications

Context-Aware Monolingual Repair for Neural Machine Translation

Context-Aware Monolingual Repair for Neural Machine Translation

When and Why is Document-level Context Useful in Neural Machine Translation?

Designing the Business Conversation Corpus

Contact Info

Product

Resources

About