Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Confer 2021
DOI: 10.18653/v1/2021.acl-long.377
|View full text |Cite
|
Sign up to set email alerts
|

Turn the Combination Lock: Learnable Textual Backdoor Attacks via Word Substitution

Abstract: Recent studies show that neural natural language processing (NLP) models are vulnerable to backdoor attacks. Injected with backdoors, models perform normally on benign examples but produce attacker-specified predictions when the backdoor is activated, presenting serious security threats to real-world applications. Since existing textual backdoor attacks pay little attention to the invisibility of backdoors, they can be easily detected and blocked. In this work, we present invisible backdoors that are activated… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
27
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
5

Relationship

3
7

Authors

Journals

citations
Cited by 69 publications
(40 citation statements)
references
References 45 publications
0
27
0
Order By: Relevance
“…Sememe knowledge bases like HowNet (Dong and Dong, 2006) use a set of predefined sememes to annotate words, so that the meaning of a word can be precisely expressed by its sememes. With the help of such sememe knowledge bases, sememes have been successfully utilized in various NLP tasks (Qi et al, 2021a), including semantic composition (Qi et al, 2019), word sense disambiguation (Hou et al, 2020), reverse dictionary (Zhang et al, 2020a), backdoor learning (Qi et al, 2021b), etc.…”
Section: Incorporation Of Sememesmentioning
confidence: 99%
“…Sememe knowledge bases like HowNet (Dong and Dong, 2006) use a set of predefined sememes to annotate words, so that the meaning of a word can be precisely expressed by its sememes. With the help of such sememe knowledge bases, sememes have been successfully utilized in various NLP tasks (Qi et al, 2021a), including semantic composition (Qi et al, 2019), word sense disambiguation (Hou et al, 2020), reverse dictionary (Zhang et al, 2020a), backdoor learning (Qi et al, 2021b), etc.…”
Section: Incorporation Of Sememesmentioning
confidence: 99%
“…They are usually very natural and fluent, thus barely distinguishable from normal samples. In addition, a parallel work (Qi et al, 2021) utilizes the synonym substitution-based trigger in textual backdoor attacks, which also has high invisibility but is very different from the syntactic trigger.…”
Section: Backdoor Attacksmentioning
confidence: 99%
“…'mm', 'bb' and 'James Bond', that can then be easily detected at test time. In [114] and [115], a less detectable trigger is used by relying on a proper combination of synonyms and syntaxes.…”
Section: B Extension To Domains Other Than Computer Visionmentioning
confidence: 99%