2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2021
DOI: 10.1109/iccv48922.2021.01007
|View full text |Cite
|
Sign up to set email alerts
|

Understanding Robustness of Transformers for Image Classification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

6
107
1

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
2
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 275 publications
(114 citation statements)
references
References 15 publications
6
107
1
Order By: Relevance
“…Also, our findings are in line with studies in non-medical domains which analyzed robustness of ViTs in technical benchmark tasks. 50,51…”
Section: Discussionmentioning
confidence: 99%
“…Also, our findings are in line with studies in non-medical domains which analyzed robustness of ViTs in technical benchmark tasks. 50,51…”
Section: Discussionmentioning
confidence: 99%
“…Most existing decision-based attack algorithms are dense attacks (the objective is to minimise L 2 or L ∞ distortion). Interestingly, these methods, including BA (Brendel et al, 2018), HSJA (Chen et al, 2020), QEBA (Li et al, 2020), NLBA (Li et al, 2021), PSBA (Zhang et al, 2021), Sign- OPT Cheng et al (2020) or the covariance matrix adaptation evolution strategy (CMA-ES) based method for face recognition tasks in (Dong et al, 2019), can be adapted to a sparse attack setting by a projection to L 0 -ball; however this is not effective, as we show later in Appendix A.7. Although CMA-ES (Dong et al, 2019) is an evolutionary algorithm, albeit for a dense attack, the formulation requires individuals of a population to be real number vectors that can be sampled from a Gaussian distribution.…”
Section: White-box Methodsmentioning
confidence: 99%
“…In contrast, Pointwise employs a purely random method to select the pixel dimension and position i, j to alter. We are motivated to investigate if decision-based dense attacks (L 2 and L ∞ constrained) such as BA (Brendel et al, 2018), HSJA (Chen et al, 2020), QEBA (Li et al, 2020), NLBA (Li et al, 2021), PSBA (Zhang et al, 2021), SignOPT (Cheng et al, 2020) or RayS (Chen & Gu, 2020) can be adapted to a sparse setting by a projection to L 0 -ball. This idea is promising because PGD can be successfully adapted to a sparse setting to provide a sparse attack algorithm in a whitebox setting.…”
Section: A4 Robustness Of Sparse Attacks Against An Adversarially Tra...mentioning
confidence: 99%
See 1 more Smart Citation
“…Investigating the robustness of pre-trained ViT is an active research area and there are several concurrent papers exploring robustness of ViT to adversarial perturbations and common corruptions (ImageNet-C). Bhojanapalli et al [2021] show the robustness of pre-trained transformers to input perturbations, and of transformers to layer removal. Caron et al [2021] demonstrate many emerging properties in self-supervised ViTs, while Shao et al…”
Section: Pre-training Neural Networkmentioning
confidence: 99%