“…Humans can identify objects that are highly distorted or highly degraded. For instance, we can readily identify images of faces that are stretched by a factor of four (Hacker & Biederman, 2018), when images are partly occluded or presented in novel poses (Biederman, 1987), and when various sorts of visual noise are added to the image (Geirhos et al, 2021). By contrast, CNNs are much worse at generalizing under these conditions (Alcorn et al, 2019;Geirhos et al, 2018Geirhos et al, , 2021Wang et al, 2018;Zhu, Tang, Park, Park, & Yuille, 2019).…”