Data augmentation using back-translation for context-aware neural machine translation

Sugiyama, Amane; Yoshinaga, Naoki

doi:10.18653/v1/d19-6504

Cited by 63 publications

(49 citation statements)

References 17 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Data augmentation is a spotlight in recent years, from a limited training data will automatically generate more training data as considered semi-supervised learning. Sennrich et al [12], Sugiyama and Yoshinaga [13] used back translation technique to generate training data to improve performance of translation model. Fadaee at al.…”

Section: Related Workmentioning

confidence: 99%

“…The biggest disadvantage of these methods is not reserving meaning concerning the context of the sentences, so we present more complex approaches retaining the meaning as the original sentence. Back translation aims to obtain more training samples based on the translators, many research teams have used to improve translation models [12][13][14][15]23]. This technique is resolved by using the translators to translate the original data to a certain language, after that taking the translated data into the independent translator to translate back to the original language.…”

Section: Data Augmentationmentioning

confidence: 99%

See 1 more Smart Citation

A review: preprocessing techniques and data augmentation for sentiment analysis

Duong

Nguyen-Thi

2021

Comput Soc Netw

View full text Add to dashboard Cite

In literature, the machine learning-based studies of sentiment analysis are usually supervised learning which must have pre-labeled datasets to be large enough in certain domains. Obviously, this task is tedious, expensive and time-consuming to build, and hard to handle unseen data. This paper has approached semi-supervised learning for Vietnamese sentiment analysis which has limited datasets. We have summarized many preprocessing techniques which were performed to clean and normalize data, negation handling, intensification handling to improve the performances. Moreover, data augmentation techniques, which generate new data from the original data to enrich training data without user intervention, have also been presented. In experiments, we have performed various aspects and obtained competitive results which may motivate the next propositions.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Data Augmentationmentioning

confidence: 99%

A review: preprocessing techniques and data augmentation for sentiment analysis

Duong

Nguyen-Thi

2021

Comput Soc Netw

View full text Add to dashboard Cite

show abstract

“…One of the techniques to get pseudo parallel corpora for context-aware NMT models is data augmentation using back-translation [17]. So, taking this approach, we assume that sentence simplification can be partially solved with the back-translation technique without fine-tuning to a downstream task or training a new model.…”

Section: Sentence Simplification Through Back-translationmentioning

confidence: 99%

ruBTS: Russian Sentence Simplification Using Back-translation

Galeev¹,

Leushina²,

Ivanov³

2021

Computational Linguistics and Intellectual Technologies

View full text Add to dashboard Cite

Automatic text simplification is a crucial task enabling to reduce text complexity while preserving meaning. This paper presents our solution to the Russian Sentence Simplification Shared Task (RSSE) based on a backtranslation technique. We show that applying the simple back-translation approach for sentence simplification can give competitive results with the other methods without fine-tuning or training.

show abstract

“…In computer vision, data augmentation technologies are widely applied to generate auxiliary training examples [20][21][22]. In NLP, back-translation has been proven to be effective in augmenting diverse instances [23][24][25]. We borrow this idea and translate training sentences into pivot languages.…”

Section: Plos Onementioning

confidence: 99%

Untitled

View full text Add to dashboard Cite

Data augmentation using back-translation for context-aware neural machine translation

Cited by 63 publications

References 17 publications

A review: preprocessing techniques and data augmentation for sentiment analysis

A review: preprocessing techniques and data augmentation for sentiment analysis

ruBTS: Russian Sentence Simplification Using Back-translation

Untitled

Contact Info

Product

Resources

About