1 2 Numerous factors have been reported to underlie the representation of complex images in high-3 level human visual cortex, including categories (e.g. faces, objects, scenes), animacy, and real-4 world size, but the extent to which this organization is reflected in behavioral judgments of real-5 world stimuli is unclear. Here, we compared representations derived from explicit similarity 6 judgments and ultra-high field (7T) fMRI of human visual cortex for multiple exemplars of a diverse 7 set of naturalistic images from 48 object and scene categories. Behavioral judgements revealed a 8 coarse division between man-made (including humans) and natural (including animals) images, 9 with clear groupings of conceptually-related categories (e.g. transportation, animals), while these 10 conceptual groupings were largely absent in the fMRI representations. Instead, fMRI responses 11 tended to reflect a separation of both human and non-human faces/bodies from all other categories. 12 This pattern yielded a statistically significant, but surprisingly limited correlation between the two 13 representational spaces. Further, comparison of the behavioral and fMRI representational spaces 14 with those derived from the layers of a deep neural network (DNN) showed a strong 15 correspondence with behavior in the top-most layer and with fMRI in the mid-level layers. These 16 results suggest that there is no simple mapping between responses in high-level visual cortex and 17 behavior -each domain reflects different visual properties of the images and responses in high-18 level visual cortex may correspond to intermediate stages of processing between basic visual 19 features and the conceptual categories that dominate the behavioral response. 20 29 highlight that great care must be taken in mapping the response of visual cortex onto behavior, 30 which clearly reflect different information. 31 50 51 Determining how responses in high-level visual cortex relate to behavior is critical for elucidating 52 the functional significance of these regions. For tasks such as identification and categorization, 53 relevant information has been reported in the responses of lOTC and vTC (Kravitz et al., 2013; 54