2020
DOI: 10.1101/2020.06.16.154542
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Simulating a Primary Visual Cortex at the Front of CNNs Improves Robustness to Image Perturbations

Abstract: Current state-of-the-art object recognition models are largely based on convolutional neural network (CNN) architectures, which are loosely inspired by the primate visual system. However, these CNNs can be fooled by imperceptibly small, explicitly crafted perturbations, and struggle to recognize objects in corrupted images that are easily recognized by humans. Here, by making comparisons with primate neural data, we first observed that CNN models with a neural hidden layer that better matches primate primary v… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

14
138
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
3
3

Relationship

0
10

Authors

Journals

citations
Cited by 120 publications
(152 citation statements)
references
References 53 publications
14
138
0
Order By: Relevance
“…Crucially, in our work we provide a direct link of the necessity of noise for systems that aim at optimizing decision behavior under our encoding and limited-capacity assumptions, which can be seen as algorithmic specifications of the more realistic population coding specifications mentioned above ( Nikitin et al, 2009 ). We argue that our results may provide a formal intuition for the apparent necessity of noise for improving training and learning performance in artificial neural networks ( Dapello et al, 2020 ; Findling and Wyart, 2020 ), and we speculate that an implementation of 'the right' noise distribution for a given environmental statistic could be seen as a potential mechanism to improve performance in capacity-limited agents generally speaking ( Garrett et al, 2011 ). We acknowledge that based on the results of our work, we cannot confirm whether this is the case for higher order neural circuits, however, we leave it as an interesting theoretical formulation, which could be addressed in future work.…”
Section: Discussionmentioning
confidence: 72%
“…Crucially, in our work we provide a direct link of the necessity of noise for systems that aim at optimizing decision behavior under our encoding and limited-capacity assumptions, which can be seen as algorithmic specifications of the more realistic population coding specifications mentioned above ( Nikitin et al, 2009 ). We argue that our results may provide a formal intuition for the apparent necessity of noise for improving training and learning performance in artificial neural networks ( Dapello et al, 2020 ; Findling and Wyart, 2020 ), and we speculate that an implementation of 'the right' noise distribution for a given environmental statistic could be seen as a potential mechanism to improve performance in capacity-limited agents generally speaking ( Garrett et al, 2011 ). We acknowledge that based on the results of our work, we cannot confirm whether this is the case for higher order neural circuits, however, we leave it as an interesting theoretical formulation, which could be addressed in future work.…”
Section: Discussionmentioning
confidence: 72%
“…But here too, if machines were burdened with humanlike visual acuity and so could barely represent the high-frequency features in the training set (i.e., the features most distorted by this sort of noise), they may be less sensitive to the patterns that later mislead them (74). Indeed, recent work finds that giving CNNs a humanlike fovea (75) or a hidden layer simulating V1 (76)…”
Section: Limit Machines Like Humansmentioning
confidence: 99%
“…In our case, the human constraint – limited visual acuity – could play a vital role in incorporating LSF information for object categorisation. In fact, modelling a human fovea (Deza & Konkle, 2020) or primary visual cortex (Dapello et al, 2020) at the front of CNNs can increase their robustness to adversarial examples. Note that adversarial examples usually include subtle changes to images at high spatial frequencies.…”
Section: Discussionmentioning
confidence: 99%