2021
DOI: 10.48550/arxiv.2103.13127
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Black-box Detection of Backdoor Attacks with Limited Information and Data

Abstract: Although deep neural networks (DNNs) have made rapid progress in recent years, they are vulnerable in adversarial environments. A malicious backdoor could be embedded in a model by poisoning the training dataset, whose intention is to make the infected model give wrong predictions during inference when the specific trigger appears. To mitigate the potential threats of backdoor attacks, various backdoor detection and defense methods have been proposed. However, the existing techniques usually require the poison… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
17
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
4

Relationship

2
2

Authors

Journals

citations
Cited by 4 publications
(17 citation statements)
references
References 14 publications
0
17
0
Order By: Relevance
“…Consistent with prior studies (Dong et al, 2021;Kolouri et al, 2020;, we deem a DNN is backdoor infected if one can make an arbitrary input misclassified as the target label, with minor modification to the input. Without loss of generability, given the original input x ∈ R n , the modified input containing the backdoor trigger can be formulated as:…”
Section: Problem Definitionmentioning
confidence: 93%
See 4 more Smart Citations
“…Consistent with prior studies (Dong et al, 2021;Kolouri et al, 2020;, we deem a DNN is backdoor infected if one can make an arbitrary input misclassified as the target label, with minor modification to the input. Without loss of generability, given the original input x ∈ R n , the modified input containing the backdoor trigger can be formulated as:…”
Section: Problem Definitionmentioning
confidence: 93%
“…However, their method still need the DNN's parameters to train a separate generator (Goodfellow et al, 2014). So, strictly speaking, their method is not "black-box", which is also revealed by (Dong et al, 2021). To the best of our knowledge, Dong et al (2021) is the only existing work on detecting backdoor-infected DNNs in the black-box settings.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations