Crowding refers to the inability to recognize an object in peripheral vision when other objects are presented nearby (Whitney & Levi, 2011). A popular explanation of crowding is that features of the target and flankers are combined inappropriately when they are located within an integration field and thus impair target recognition (Pelli, Palomares, & Majaj, 2004). However, it remains unclear which features of the target and flankers are combined inappropriately to cause crowding (Levi, 2008). For example, in a complex stimulus (e.g., a face), to what extent does crowding result from the integration of features at a part-based level or at the level of global processing of configural appearance? In this study, we used a face categorization task and different types of flankers to examine how much the magnitude of visual crowding depends upon similarity of face parts or similarity of global configurations. We created flankers with face-like features (e.g. the eyes, nose, and mouth) in typical and scrambled configurations to examine the impact of part appearance and global configuration on the visual crowding of faces. Additionally, we used “electrical socket” flankers that mimicked 1st-order face configuration but had only schematic features to examine the extent to which global face geometry impacted crowding. Our results indicate that both face parts and configurations contribute to visual crowding, suggesting that face similarity as realized under crowded conditions includes both aspects of facial appearance.