To benefit from group living, individuals need to maintain cohesion and coordinate their activities. Effective communication thus becomes critical, facilitating rapid coordination of behaviours and reducing consensus costs when group members have differing needs and information. In many bird and mammal species, collective decisions rely on acoustic signals in some contexts but on movement cues in others. Yet, to date there is no clear conceptual framework that predicts when decisions should evolve to be based on acoustic signals versus movement cues. Here, we first review how acoustic signals and movement cues are used for coordinating activities. We then outline how information masking, discrimination ability (Weber’s Law), and encoding limitations, as well as trade-offs between these, can identify which types of collective behaviours likely rely on acoustic signals or movement cues. Specifically, our framework proposes that behaviours involving the timing of events or expression of specific actions should rely more on acoustic signals, whereas decisions involving complex choices with multiple options (e.g. direction, destination) should generally use movement cues because sounds are more vulnerable to information masking and Weber’s Law effects. We then discuss potential future avenues, including multimodal communication and collective decision-making by mixed-species animal groups.