Research on speech rhythm has been notoriously oblivious to describing actual rhythms in speech. We present here a model of speech rhythm at the sentence level inspired by musical conceptions of meter. We posit that speech is underlain by a basic metricality. However, instead of arguing that speech is isochronous, we propose that utterances can have internal changes of meter, making them “heterometric.” In addition, we see 2 rhythmic devices for obviating the need for meter changes within utterances and thus maintaining the stability of the rhythm. Both of them involve subdivisions of component beats into subbeats: 1) subdivisions into 2’s and 3’s, resulting in duplets and triplets, respectively; and 2) subdivisions according to complex ratios, resulting in polyrhythms. We tested the model acoustically by having a group of 14 participants read unfamiliar sentences aloud and examining the extent to which their timing conformed with the predictions of a priori rhythmic transcriptions of the sentences. The observed patterns of variability in speech timing for these sentences, when measured at the bar level of the transcription, were generally consistent with the musical model.
We present here a musical approach to speech melody, one that takes advantage of the intervallic precision made possible with musical notation. Current phonetic and phonological approaches to speech melody either assign localized pitch targets that impoverish the acoustic details of the pitch contours and/or merely highlight a few salient points of pitch change, ignoring all the rest of the syllables. We present here an alternative model using musical notation, which has the advantage of representing the pitch of all syllables in a sentence as well as permitting a specification of the intervallic excursions among syllables and the potential for group averaging of pitch use across speakers. We tested the validity of this approach by recording native speakers of Canadian English reading unfamiliar test items aloud, spanning from single words to full sentences containing multiple intonational phrases. The fundamental-frequency trajectories of the recorded items were converted from hertz into semitones, averaged across speakers, and transcribed into musical scores of relative pitch. Doing so allowed us to quantify local and global pitch-changes associated with declarative, imperative, and interrogative sentences, and to explore the melodic dynamics of these sentence types. Our basic observation is that speech is atonal. The use of a musical score ultimately has the potential to combine speech rhythm and melody into a unified representation of speech prosody, an important analytical feature that is not found in any current linguistic approach to prosody.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.