LexicalAT: Lexical-Based Adversarial Reinforcement Training for Robust Sentiment Classification

Xu, Jingjing; Zhao, Liang; Yan, Hanqi; Zeng, Qi; Liang, Yun; Sun, Xu

doi:10.18653/v1/d19-1554

Cited by 17 publications

(14 citation statements)

References 14 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Synonymous sample generation aimed to randomly replace some words in the real samples with their synonyms, hypernyms, or hyponyms from WordNet to generate a large amount of synonymous samples (Zhang et al, 2015;Kobayashi, 2018;Xu et al, 2019). However, these methods tend to suffer from the spurious association problem.…”

Section: Related Workmentioning

confidence: 99%

“…However, these methods tend to suffer from the spurious association problem. It is worth noting that our model is similar to Xu et al (2019), but there are a number of major differences. Firstly, it focused on generating synonymous samples with the same sentiment label, while our work aims to generate antonymous samples with the reversed sentiment label; Secondly, our discriminator contains an original-side predictor and an antonymous-side predictor which are paired for dual sentiment classification, and alleviate the spurious association problem.…”

Section: Related Workmentioning

confidence: 99%

“…The word substitution-based methods have been shown to be effective and stable in synonymous sentence generation. Inspired by Xu et al (2019), we propose to generate antonymous sentences based on word substitution. Specifically, we define three word substitution rules for each word in the sentence: no replacement, replacing with an antonym, and replacing with a synonym.…”

Section: Antonymous Sentence Generatormentioning

confidence: 99%

“…In policy gradient-based methods, it is a common practice to subtract a baseline reward from the current reward. The goal of the baseline reward r b is to enforce the generator to select x that yields a reward In contrast to Xu et al (2019) that only sampled one synonymous sentence for each sentence and defined r b as the expectation of the reward of all sampling sentences, we sample M antonymous sentences for each sentence, and use the average value of these M antonymous sentences as the baseline reward…”

Section: Reinforcement Trainingmentioning

confidence: 99%

“…Researchers have attempted to address this issue from two main perspectives: data augmentation and adversarial perturbation. The former tries to augment the training data by generating synonymous sentences (Zhang et al, 2015;Kobayashi, 2018;Xu et al, 2019); the latter aims to improve the generalization ability by applying perturbations to the word embeddings (Miyato et al, 2017;Croce et al, 2020). Although these methods have achieved sound performance, they still suffer from the spurious association problem.…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Reinforced Counterfactual Data Augmentation for Dual Sentiment Classification

Chen¹,

Xia²,

Yu³

2021

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing

View full text Add to dashboard Cite

Data augmentation and adversarial perturbation approaches have recently achieved promising results in solving the over-fitting problem in many natural language processing (NLP) tasks including sentiment classification. However, existing studies aimed to improve the generalization ability by augmenting the training data with synonymous examples or adding random noises to word embeddings, which cannot address the spurious association problem. In this work, we propose an end-toend reinforcement learning framework, which jointly performs counterfactual data generation and dual sentiment classification. Our approach has three characteristics: 1) the generator automatically generates massive and diverse antonymous sentences; 2) the discriminator contains a original-side sentiment predictor and an antonymous-side sentiment predictor, which jointly evaluate the quality of the generated sample and help the generator iteratively generate higher-quality antonymous samples; 3) the discriminator is directly used as the final sentiment classifier without the need to build an extra one. Extensive experiments show that our approach outperforms strong data augmentation baselines on several benchmark sentiment classification datasets. Further analysis confirms our approach's advantages in generating more diverse training samples and solving the spurious association problem in sentiment classification.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Section: Antonymous Sentence Generatormentioning

confidence: 99%

Section: Reinforcement Trainingmentioning

confidence: 99%