2022
DOI: 10.48550/arxiv.2203.07983
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Adversarial Robustness of Neural-Statistical Features in Detection of Generative Transformers

Abstract: The detection of computer-generated text is an area of rapidly increasing significance as nascent generative models allow for efficient creation of compelling human-like text, which may be abused for the purposes of spam, disinformation, phishing, or online influence campaigns. Past work has studied detection of current state-of-the-art models, but despite a developing threat landscape, there has been minimal analysis of the robustness of detection methods to adversarial attacks. To this end, we evaluate neura… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 24 publications
0
1
0
Order By: Relevance
“…Most work on automated detection targeted GPT-2-generated text, although Bakhtin et al (2019), Uchendu et al (2020), Fagni et al (2021) and Stiff and Johansson (2021) also experimented with other generators. Finally, there is a growing body of work on the adversarial robustness of automated detectors of synthetic text (Wolff, 2020;Bhat and Parthasarathy, 2020;Stiff and Johansson, 2021;Crothers et al, 2022). A survey on the automatic detection of synthetic text can be found in (Jawahar et al, 2020).…”
Section: Related Workmentioning
confidence: 99%
“…Most work on automated detection targeted GPT-2-generated text, although Bakhtin et al (2019), Uchendu et al (2020), Fagni et al (2021) and Stiff and Johansson (2021) also experimented with other generators. Finally, there is a growing body of work on the adversarial robustness of automated detectors of synthetic text (Wolff, 2020;Bhat and Parthasarathy, 2020;Stiff and Johansson, 2021;Crothers et al, 2022). A survey on the automatic detection of synthetic text can be found in (Jawahar et al, 2020).…”
Section: Related Workmentioning
confidence: 99%