Dynamic Past and Future for Neural Machine Translation

Zheng, Zaixiang; Huang, Shujian; Tu, Zhaopeng; Dai, Xinyu; Chen, Jiajun

doi:10.18653/v1/d19-1086

Cited by 31 publications

(17 citation statements)

References 24 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Translation Quality The results on the EN→DE, DE→EN and ZH→EN are shown in Table 1. For a fair comparison, we also report several Transformer baseline from previous work (Vaswani et al 2017;Zheng et al 2019;Dou et al 2018). Our Transformer baseline achieves similar or better results comparing with them.…”

Section: Resultsmentioning

confidence: 99%

Acquiring Knowledge from Pre-Trained Model to Neural Machine Translation

Weng

Huang

et al. 2020

AAAI

Self Cite

View full text Add to dashboard Cite

Pre-training and fine-tuning have achieved great success in natural language process field. The standard paradigm of exploiting them includes two steps: first, pre-training a model, e.g. BERT, with a large scale unlabeled monolingual data. Then, fine-tuning the pre-trained model with labeled data from downstream tasks. However, in neural machine translation (NMT), we address the problem that the training objective of the bilingual task is far different from the monolingual pre-trained model. This gap leads that only using fine-tuning in NMT can not fully utilize prior language knowledge. In this paper, we propose an Apt framework for acquiring knowledge from pre-trained model to NMT. The proposed approach includes two modules: 1). a dynamic fusion mechanism to fuse task-specific features adapted from general knowledge into NMT network, 2). a knowledge distillation paradigm to learn language knowledge continuously during the NMT training process. The proposed approach could integrate suitable knowledge from pre-trained models to improve the NMT. Experimental results on WMT English to German, German to English and Chinese to English machine translation tasks show that our model outperforms strong baselines and the fine-tuning counterparts.

show abstract

Section: Resultsmentioning

confidence: 99%

Acquiring Knowledge from Pre-Trained Model to Neural Machine Translation

Weng

Huang

et al. 2020

AAAI

Self Cite

View full text Add to dashboard Cite

show abstract

“…Several Transformer systems with the same settings (Vaswani et al, 2017;Hassan et al, 2018;Gu et al, 2017) are reported as a comparison (line 1-6). Then, several related researches about improve faithfulness of NMT (Kong et al, 2019;Zheng et al, 2019;Feng et al, 2020) or exploiting translations for improving NMT (Xia et al, 2017;) also be reported (line 7-12). We implement three comparable approaches on our Transformer baseline, including: 1).…”

Section: Automatic Evaluationmentioning

confidence: 96%

“…proposed to model global representation in the source side to improve the source representation. Zheng et al (2019) proposed a capsule based module to control the source representation dynamically in the decoding process. ), Feng et al (2020 and Garg et al (2019) proposed to introduce word alignment information in Transformer to improve translation accuracy.…”

Section: Related Workmentioning

confidence: 99%

“…Several recent studies are proposed following one of the above perspectives and have achieved considerable effects. Zheng et al (2019) proposed to divide the encoder output into past and future parts to fine-grained modeling contextual representation. Feng et al (2020) proposed a faithfulness part to optimize the contextual representation before feeding into the decoder.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Towards Enhancing Faithfulness for Neural Machine Translation

Weng

Wei

et al. 2020

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)

View full text Add to dashboard Cite

Neural machine translation (NMT) has achieved great success due to the ability to generate high-quality sentences. Compared with human translations, one of the drawbacks of current NMT is that translations are not usually faithful to the input, e.g., omitting information or generating unrelated fragments, which inevitably decreases the overall quality, especially for human readers. In this paper, we propose a novel training strategy with a multi-task learning paradigm to build a faithfulness enhanced NMT model (named FENMT). During the NMT training process, we sample a subset from the training set and translate them to get fragments that have been mistranslated. Afterward, the proposed multi-task learning paradigm is employed on both encoder and decoder to guide NMT to correctly translate these fragments. Both automatic and human evaluations verify that our FENMT could improve translation quality by effectively reducing unfaithful translations.

show abstract

“…Therefore, we construct a high quality annotated corpus (TransErr) comprising 15000 Chinese-English translation pairs with inter-annotator agreement at 0.804 measured by Cohen's Kappa (Cohen, 1960). Different from existing error detection works which focus on all error classes, we currently only take care of missing and wrong translation , the major errors related to adequacy, which is a wide-known issue in neural machine translation (NMT) (Zheng et al, 2019). The errors tags are annotated on source (Chinese) sentences to reflect the loyalty and adequacy with respect to the source.…”

Section: Introductionmentioning

confidence: 99%

Revisit Automatic Error Detection for Wrong and Missing Translation – A Supervised Approach

Lei

Xu²,

Aw³

et al. 2019

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conferen

View full text Add to dashboard Cite

While achieving great fluency, current machine translation (MT) techniques are bottlenecked by adequacy issues. To have a closer study of these issues and accelerate model development, we propose automatic detecting adequacy errors in MT hypothesis for MT model evaluation. To do that, we annotate missing and wrong translations, the two most prevalent issues for current neural machine translation model, in 15000 Chinese-English translation pairs. We build a supervised alignment model for translation error detection (AlignDet) based on a simple Alignment Triangle strategy to set the benchmark for automatic error detection task. We also discuss the difficulties of this task and the benefits of this task for existing evaluation metrics.

show abstract

Dynamic Past and Future for Neural Machine Translation

Cited by 31 publications

References 24 publications

Acquiring Knowledge from Pre-Trained Model to Neural Machine Translation

Acquiring Knowledge from Pre-Trained Model to Neural Machine Translation

Towards Enhancing Faithfulness for Neural Machine Translation

Revisit Automatic Error Detection for Wrong and Missing Translation – A Supervised Approach

Contact Info

Product

Resources

About