Introduction Grouping and boundary perception are central to the understanding and modelling of core tasks in many areas of cognitive science. They are fundamental processes in, for example, natural language processing (eg speech segmentation and word discoveryöBrent 1999b; Jusczyk 1997), motor learning (eg identifying behavioural episodes öReynolds et al 2007; Newtson 1973), memory storage and retrieval (eg chunkingöKurby and Zacks 2007) and visual perception (eg analysing spatial organisation öMarr 1982). Our focus in this paper is on the perception and cognition of music (Krumhansl 1990; Temperley 2001), where the process by which the human perceptual system groups sequential musical elements together is one of the most fundamental issues. In particular, we examine the grouping of musical elements into contiguous segments that occur sequentially in time or, to put it another way, the identification of boundaries between the final element of one segment and the first element of the subsequent one. This way of structuring a musical surface is usually referred to as grouping (Lerdahl and Jackendoff 1983) or segmentation (Cambouropoulos 2006). We distinguish this kind of perceptual aggregation of auditory elements from the integration, or fusion, of auditory elements that occur simultaneously in time and also from the segregation of parallel auditory streams (Bregman 1990). In musical terms, the kinds of groups we consider correspond with motifs, phrases, sections and other aspects of musical form. We use the term grouping structure to refer to a piece of music structured in this way. It is taken that, just as speech is perceptually segmented into phonemes, and then words which subsequently provide the building blocks for the perception of phrases and complete utterances (Brent 1999b; Jusczyk 1997), motifs or phrases in music are identified by listeners, stored in memory and made available for inclusion in higher-level structural groups (Lerdahl and Jackendoff 1983; Peretz 1989; Tan et al 1981). The lowlevel organisation of the musical surface into groups allows the use of these primitive perceptual units in more complex structural processing and may alleviate demands on memory.