Our visual environments are composed of an abundance of individual objects. The efficiency with which we can parse such rich environments is remarkable. Previous work suggests that this efficiency is partly explained by grouping mechanisms, which allow the visual system to process the objects that surround us as meaningful groups rather than individual entities. Here, we show that the grouping of objects in typically and meaningfully structured environments directly relates to a reduction of perceived complexity. In an object numerosity discrimination task, we showed participants pairs of schematic scene miniatures, in which objects were structured in typical or atypical ways, and asked them to judge which scene consisted of more individual objects. We obtained two key results: First, participants were less accurate in comparing numerosities between typically structured scenes than between atypically structured scenes, suggesting that grouping processes hinder the effective individuation of separate objects in typically structured scenes. Second, participants underestimated the number of objects in typically structured, compared to atypically structured, scenes, suggesting that grouping based on typical object configurations reduces the perceived numerical complexity of a scene. In a control experiment, we show that this overestimation is specific to upright scenes, indicating that is not related to basic visual feature differences between typically and atypically structured scenes. Together, our results suggest that our visual surroundings appear less complex to the visual system than the number of objects in them makes us believe.