“…Work from our own research group has shown that binocular rivalry competition, spatial and feature-based attention, object-based attentional selection, and visual working memory all lead to powerful modulatory effects in the human primary visual cortex ( Cohen & Tong, 2015 ; Harrison & Tong, 2009 ; Jehee, Brady, & Tong, 2011 ; Kamitani & Tong, 2005 ; Tong & Engel, 2001 ). By contrast, a limitation of feedforward neural network models is their inability to account for top–down effects of attention and other task-based goals ( Kay, Bonnen, Denison, Arcaro, & Barack, 2023 ; Tong, 2018 ). It will be of considerable interest for future studies to explore whether variations in CNN architecture, the incorporation of recurrent or top–down processing, or the expansion of stimuli and methods used for network training can further improve the ability of CNN models to predict the nonlinear response properties of V1.…”