“…Despite being developed as computer vision tools, DNNs trained to recognise objects in images are also unsurpassed at predicting how natural images are represented in high-level ventral visual areas of the human and non-human primate brain (Agrawal et al, 2014;Bashivan et al, 2019;Cadieu et al, 2014;Cichy et al, 2016;Devereux et al, 2018;Eickenberg et al, 2017;Güçlü & van Gerven, 2015;Horikawa & Kamitani, 2017;Kubilius et al, 2018;Lindsay, 2020;Ponce et al, 2019;Schrimpf et al, 2018;Xu & Vaziri-Pashkam, 2020;Yamins & DiCarlo, 2016). There is some variability in the accuracy with which different recent DNNs can predict high-level visual representations Xu & Vaziri-Pashkam, 2020;Zeman et al, 2020), despite broadly high performance. It remains unclear how strongly network design choices, such as depth, architecture, task training, and subsequent model fitting to neural data may contribute to the observed variations.…”