Sequence Prediction with Unlabeled Data by Reward Function Learning

Wu, Lijun; Li, Zhao; Qin, Tao; Lai, Jianhuang; Liu, Tie-Yan

doi:10.24963/ijcai.2017/432

Cited by 11 publications

(6 citation statements)

References 9 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Zhang and Zong (2016) propose self learning approach to generate the synthetic data for the source-side monolingual data, which is a semi-supervised method. Wu et al (2017) leverage the source-side monolingual data to train the NMT system by learning reward func-4200 tion in a reinforcement learning framework.…”

Section: Improving Nmt By Monolingual Datamentioning

confidence: 99%

Exploiting Monolingual Data at Scale for Neural Machine Translation

Wu¹,

Wang²,

Xia³

et al. 2019

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conferen

Self Cite

View full text Add to dashboard Cite

While target-side monolingual data has been proven to be very useful to improve neural machine translation (briefly, NMT) through back translation, source-side monolingual data is not well investigated. In this work, we study how to use both the source-side and targetside monolingual data for NMT, and propose an effective strategy leveraging both of them. First, we generate synthetic bitext by translating monolingual data from the two domains into the other domain using the models pretrained on genuine bitext. Next, a model is trained on a noised version of the concatenated synthetic bitext where each source sequence is randomly corrupted. Finally, the model is fine-tuned on the genuine bitext and a clean version of a subset of the synthetic bitext without adding any noise. Our approach achieves state-of-the-art results on WMT16, WMT17, WMT18 English$German translations and WMT19 German!French translations, which demonstrate the effectiveness of our method. We also conduct a comprehensive study on how each part in the pipeline works.

show abstract

Section: Improving Nmt By Monolingual Datamentioning

confidence: 99%

Exploiting Monolingual Data at Scale for Neural Machine Translation

Wu¹,

Wang²,

Xia³

et al. 2019

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conferen

Self Cite

View full text Add to dashboard Cite

show abstract

“…To address the inconsistency issue, reinforcement learning (RL) methods have been adopted to optimize sequence-level objectives. For example, policy optimization methods such as REINFORCE (Ranzato et al, 2016;Wu et al, 2017b) and actorcritic (Bahdanau et al, 2017) are leveraged for sequence generation tasks including NMT. In machine translation community, a similar method is proposed with the name 'minimum risk training' (Shen et al, 2016).…”

Section: Introductionmentioning

confidence: 99%

A Study of Reinforcement Learning for Neural Machine Translation

Wu¹,

Tian²,

Qin³

et al. 2018

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing

Self Cite

149

View full text Add to dashboard Cite

Recent studies have shown that reinforcement learning (RL) is an effective approach for improving the performance of neural machine translation (NMT) system. However, due to its instability, successfully RL training is challenging, especially in real-world systems where deep models and large datasets are leveraged. In this paper, taking several largescale translation tasks as testbeds, we conduct a systematic study on how to train better NMT models using reinforcement learning. We provide a comprehensive comparison of several important factors (e.g., baseline reward, reward shaping) in RL training. Furthermore, to fill in the gap that it remains unclear whether RL is still beneficial when monolingual data is used, we propose a new method to leverage RL to further boost the performance of NMT systems trained with source/target monolingual data. By integrating all our findings, we obtain competitive results on WMT14 English-German, WMT17 English-Chinese, and WMT17 Chinese-English translation tasks, especially setting a state-of-theart performance on WMT17 Chinese-English translation task.

show abstract

“…Even though RL based models are difficult to train, in recent years, multiple works (Mnih et al, 2014;Choi et al, 2017;Yu et al, 2017;Narayan et al, 2018;Sathish et al, 2018;Shen et al, 2018) have shown to improve the performance of several natural language processing tasks. Also, it has been used in NMT Wu et al, 2017;Bahdanau et al, 2017) to overcome the inconsistency between the token level objective function and sequence-level evaluation metrics such as BLEU. Our approach is also related to the method proposed by Lei et al (2016) to explain the decision of text classifier.…”

Section: Related Workmentioning

confidence: 99%

Look Harder: A Neural Machine Translation Model with Hard Attention

Indurthi¹,

Chung²,

Kim³

2019

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics

View full text Add to dashboard Cite

Soft-attention based Neural Machine Translation (NMT) models have achieved promising results on several translation tasks. These models attend all the words in the source sequence for each target token, which makes them ineffective for long sequence translation. In this work, we propose a hard-attention based NMT model which selects a subset of source tokens for each target token to effectively handle long sequence translation. Due to the discrete nature of the hard-attention mechanism, we design a reinforcement learning algorithm coupled with reward shaping strategy to efficiently train it. Experimental results show that the proposed model performs better on long sequences and thereby achieves significant BLEU score improvement on English-German (EN-DE) and English-French (EN-FR) translation tasks compared to the soft-attention based NMT.

show abstract

Sequence Prediction with Unlabeled Data by Reward Function Learning

Cited by 11 publications

References 9 publications

Exploiting Monolingual Data at Scale for Neural Machine Translation

Exploiting Monolingual Data at Scale for Neural Machine Translation

A Study of Reinforcement Learning for Neural Machine Translation

Look Harder: A Neural Machine Translation Model with Hard Attention

Contact Info

Product

Resources

About