We investigate how the population nonlinearities resulting from lateral inhibition and thresholding in sparse coding networks influence neural response selectivity and robustness. We show that when compared to pointwise nonlinear models, such population nonlinearities improve the selectivity to a preferred stimulus and protect against adversarial perturbations of the input. These findings are predicted from the geometry of the single-neuron iso-response surface, which provides new insight into the relationship between selectivity and adversarial robustness. Inhibitory lateral connections curve the iso-response surface outward in the direction of selectivity. Since adversarial perturbations are orthogonal to the iso-response surface, adversarial attacks tend to be aligned with directions of selectivity. Consequently, the network is less easily fooled by perceptually irrelevant perturbations to the input. Together, these findings point to benefits of integrating computational principles found in biological vision systems into artificial neural networks.
Any visual system, biological or artificial, must make a trade-off between the number of units used to represent the visual environment and the spatial resolution of the sampling array. Humans and some other animals are able to allocate attention to spatial locations to reconfigure the sampling array of receptive fields (RFs), thereby enhancing the spatial resolution of representations without changing the overall number of sampling units. Here, we examine how representations of visual features in a fully convolutional neural network interact and interfere with each other in an eccentricity-dependent RF pooling array and how these interactions are influenced by dynamic changes in spatial resolution across the array. We study these feature interactions within the framework of visual crowding, a well-characterized perceptual phenomenon in which target objects in the visual periphery that are easily identified in isolation are much more difficult to identify when flanked by similar nearby objects. By separately simulating effects of spatial attention on RF size and on the density of the pooling array, we demonstrate that the increase in RF density due to attention is more beneficial than changes in RF size for enhancing target classification for crowded stimuli. Furthermore, by varying target and flanker spacing, as well as the spatial extent of attention, we find that feature redundancy across RFs has more influence on target classification than the fidelity of the feature representations themselves. Based on these findings, we propose a candidate mechanism by which spatial attention relieves visual crowding through enhanced feature redundancy that is mostly due to increased RF density.
Visual spatial attention can be allocated in two distinct ways: one that is voluntarily directed to behaviorally relevant locations in the world, and one that is involuntarily captured by salient external stimuli. Precueing spatial attention has been shown to improve perceptual performance on a number of visual tasks. However, the effects of spatial attention on visual crowding, defined as the reduction in the ability to identify target objects in clutter, are far less clear. In this study, we used an anticueing paradigm to separately measure the effects of involuntary and voluntary spatial attention on a crowding task. Each trial began with a brief peripheral cue that predicted that the crowded target would appear on the opposite side of the screen 80% of the time and on the same side of the screen 20% of the time. Subjects performed an orientation discrimination task on a target Gabor patch that was flanked by other similar Gabor patches with independent random orientations. For trials with a short stimulus onset asynchrony between cue and target, involuntary capture of attention led to faster response times and smaller critical spacing when the target appeared on the cue side. For trials with a long stimulus onset asynchrony, voluntary allocation of attention led to faster reaction times but no significant effect on critical spacing when the target appeared on the opposite side to the cue. We additionally found that the magnitudes of these cueing effects of involuntary and voluntary attention were not strongly correlated across subjects for either reaction time or critical spacing.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.