Background This article focuses on communication in team-based esports, particularly in the ways that callouts enable players in team-based First-Person Shooters (FPS) to collaboratively link their own perception and awareness of in-game actions to that of their teammates. Callouts are short, community-based utterances that players use to communicate vital details of fast-paced action in competitive games. Aim We provide an empirically-based theorization of why callouts appear to be especially important in team-based FPS games, which, because of the limited fields of vision and split-second decision-making, require players to communicate what is happening to the others in the team as they navigate the game environment. Methods To describe this distributed perception, we borrow from studies on active military settings that term this seeing together as interperceptivity and employ ethnomethodology in our analysis of the minute details of players’ actions in the screen recordings as they extended their team’s collective perception and awareness of in-game activities and events. Results Through this paper, we contribute to the ongoing research on understanding communication and collaboration in team-based games. The callout sequences (and aligning actions) are orienting towards sharing individual perceptions for the (co)construction of an interperceptivity of in-game activities. Hence, callouts form a precondition for coordinated play. Conclusion The introduction of this concept to game studies can help in making sense of a key capability in networked team-based games; that is, how players collectively construct a situational awareness that encompasses teammates’ perception. Also, because of the essential role of callouts and interperceptivity in highly-skilled networked play, we point to some of the cultural contexts in which this practice is accomplished.