How do humans perceive adversarial text? A reality check on the validity and naturalness of word-based adversarial attacks

Dyrmishi, Salijona; Ghamizi, Salah; Cordy, Maxime

doi:10.18653/v1/2023.acl-long.491

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) 2023

DOI: 10.18653/v1/2023.acl-long.491

|View full text |Cite

How do humans perceive adversarial text? A reality check on the validity and naturalness of word-based adversarial attacks

Salijona Dyrmishi¹,

Salah Ghamizi²,

Maxime Cordy³

Abstract: Natural Language Processing (NLP) models based on Machine Learning (ML) are susceptible to adversarial attacks -malicious algorithms that imperceptibly modify input text to force models into making incorrect predictions. However, evaluations of these attacks ignore the property of imperceptibility or study it under limited settings. This entails that adversarial perturbations would not pass any human quality gate and do not represent real threats to human-checked NLP systems. To bypass this limitation and enab… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2024

Publication Types

Select...

Article1

Relationship

Self Cite0

Independent1

Authors

Journals

Cited by 1 publication

References 19 publications

(24 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

Understanding users' effective use of generative conversational AI from a media naturalness perspective: A hybrid SEM-ANN approach

Wang,

Lu,

Pan

2024

Data Science and Management

View full text Add to dashboard Cite

Understanding users' effective use of generative conversational AI from a media naturalness perspective: A hybrid SEM-ANN approach

Wang,

Lu,

Pan

2024

Data Science and Management

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

How do humans perceive adversarial text? A reality check on the validity and naturalness of word-based adversarial attacks

Cited by 1 publication

References 19 publications

Understanding users' effective use of generative conversational AI from a media naturalness perspective: A hybrid SEM-ANN approach

Understanding users' effective use of generative conversational AI from a media naturalness perspective: A hybrid SEM-ANN approach

Contact Info

Product

Resources

About