Improved Text Classification via Contrastive Adversarial Training

Pan, Lin; Hang, Chung-Wei; Sil, Avirup; Potdar, Saloni

doi:10.1609/aaai.v36i10.21362

Cited by 45 publications

(14 citation statements)

References 40 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…A novel technique for regularizing the fine-tuning of Transformer-based encoders for text classification problems is provided in Ref. 18 . The model’s word embedding matrix is perturbed to provide adversarial examples, and contrastive learning is used to train the model to learn noise-invariant representation using clean and adversarial examples.…”

Section: Literature Surveymentioning

confidence: 99%

“…AraBERTv2 achieved the highest accuracy, precision, and f1-score of 97% on the description dataset among all other transformer architectures It is only restricted to Arabic news content 16 2022 Pre-trained BERT model Detect fake news in a region-based distributed approach FakeNewsNet dataset Precision, recall, and F1-score The model achieved an accuracy of 91% The distributed framework further needs to be optimized in the mobile crowdsensing environment 17 2021 A hybrid model combining RNN with Bidirectional GRU and SVM Identify real and fake news FakeNewsNet dataset Accuracy, precision, recall, and F1-score Suggested methodology performed better than cutting-edge techniques The limitation of Support Vector Machines (SVM) is that their performance depends on the feature vector's size. In this case, the minimum size of the feature vector was restricted to 512 units, which is the output of the GRUs 18 2022 Perturbation of word embedding matrix and contrastive learning using transformers such as BERT and RoBERTa GLUE benchmark tasks and three intent classification datasets Accuracy The method demonstrates an improvement of 1.7% on average over BERTLarge and 1.3% over RoBERTaLarge. On intent classification tasks, the fine-tuned RoBERTaLarge outperforms the RoBERTaLarge baseline by 1% on the entire test sets and 2% on the more challenging test sets Regularizing Transformer-based encoders for text classification problems Modest perturbations to input vector entries may not be appropriate for sparse high-dimensional inputs 19 2022 CRAL (consistent regularization for adaptation learning) and VAT (virtual adversarial training) with entropy minimization Two MDTC (multi-domain text classification) benchmarks Accuracy 88% and 90% on both datasets Adversarial training for specific domain adaptation Accuracy is compromised in an unseen domain …”

Section: Literature Surveymentioning

confidence: 99%

See 1 more Smart Citation

ANN: adversarial news net for robust fake news classification

Maham,

Tariq,

Khan

et al. 2024

Sci Rep

View full text Add to dashboard Cite

With easy access to social media platforms, spreading fake news has become a growing concern today. Classifying fake news is essential, as it can help prevent its negative impact on individuals and society. In this regard, an end-to-end framework for fake news detection is developed by utilizing the power of adversarial training to make the model more robust and resilient. The framework is named "ANN: Adversarial News Net," emoticons have been extracted from the datasets to understand their meanings concerning fake news. This information is then fed into the model, which helps to improve its performance in classifying fake news. The performance of the ANN framework is evaluated using four publicly available datasets, and it is found to outperform baseline methods and previous studies after adversarial training. Experiments show that Adversarial Training improved the performance by 2.1% over the Random Forest baseline and 2.4% over the BERT baseline method in terms of accuracy. The proposed framework can be used to detect fake news in real-time, thereby mitigating its harmful effects on society.

show abstract

Section: Literature Surveymentioning

confidence: 99%

Section: Literature Surveymentioning

confidence: 99%

ANN: adversarial news net for robust fake news classification

Maham,

Tariq,

Khan

et al. 2024

Sci Rep

View full text Add to dashboard Cite

show abstract

“…In particular, because of the convergence of models such as BERT [18], XLNet [34], RoBERTa [13], T5 [15], and ELECTRA [35], research has been conducted to apply AT to the fine-tuning process of pre-trained language models. AT has been shown to improve the performance of text classification tasks [36], and AT is also effective for fine-tuning and pre-training language models [37]- [39]. It has been empirically demonstrated that AT is effective when applied to a BERT model [40].…”

Section: B Adversarial Trainingmentioning

confidence: 99%

Enhancing Machine-Generated Text Detection: Adversarial Fine-Tuning of Pre-Trained Language Models

Hee Lee,

Jang

2024

IEEE Access

View full text Add to dashboard Cite

Advances in large language models (LLMs) have revolutionized the natural language processing field. However, the texts generated raise several social issues. In addition, detecting machine-generated text is becoming increasingly difficult because it produces text that resembles human writing. We propose a new method for effectively detecting machine-generated text by applying adversarial training (AT) to pre-trained language models (PLMs), such as Bidirectional Encoder Representations from Transformers (BERT). We generated adversarial examples that appeared to have been modified by humans and applied them to the PLMs to improve the model's detection capabilities. The proposed method was validated on various datasets and experiments. It showed improved performance compared to traditional fine-tuning methods, with an average reduction in the probability of misclassification of machine-generated text by about 10%. We demonstrated the robustness of the model when generated with input tokens of different lengths and under different training data ratios. We suggested future research directions for applying AT to different languages and language model types. This study opens new possibilities for applying AT to the problem of machine-generated text detection and classification and contributes to building more effective detection models.

show abstract

“…Their research shows that Roberta Large also performs 1-2% better than the Roberta Large baseline. In reference 17 , the authors propose a novel method called CRAL (Consistent Regularization for Adaptation Learning) for performing domain adaptation. The approach involves creating two distinct shared latent spaces, conducting domain alignment for each space, and penalizing any inconsistencies between the two alignments in terms of predictions for unlabeled data.…”

Section: Adversarial Trainingmentioning

confidence: 99%

ANN: Adversarial News Net

Maham

Shoaib

2023

Preprint

View full text Add to dashboard Cite

With the ease of access to social media platforms, the spread of fake news has become a growing concern in today’s society. Classifying fake news is an important task, as it can help prevent its negative impact on individuals and society. In this work, an end-to-end framework for fake news detection is developed by utilizing the power of adversarial training to make the model more robust. This framework is named “ANN: Adversarial News Net”. The performance of ANN is evaluated using four publicly available datasets, and it is found to outperform previous studies after adversarial training. Furthermore, emoticons have been extracted from the dataset to understand their meanings in relation to fake news. This information is then fed into the model, which helped to improve its performance in classifying fake news. The proposed framework has the potential to be used as a tool for detecting fake news in real-time, thereby mitigating its harmful effects on society.

show abstract

Improved Text Classification via Contrastive Adversarial Training

Cited by 45 publications

References 40 publications

ANN: adversarial news net for robust fake news classification

ANN: adversarial news net for robust fake news classification

Enhancing Machine-Generated Text Detection: Adversarial Fine-Tuning of Pre-Trained Language Models

ANN: Adversarial News Net

Contact Info

Product

Resources

About