2003
DOI: 10.1016/j.wocn.2003.09.005
|View full text |Cite
|
Sign up to set email alerts
|

Temporal properties of spontaneous speech—a syllable-centric perspective

Abstract: Temporal properties associated with the speech signal are potentially important for understanding spoken language. Five hours of spontaneous American English dialogue material (from the SWITCHBOARD corpus) were hand-labeled and segmented at the phonetic-segment level; a fortyfive-minute subset was also manually annotated (at the syllabic level) with respect to stress accent.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

28
288
2
1

Year Published

2005
2005
2023
2023

Publication Types

Select...
6
2

Relationship

1
7

Authors

Journals

citations
Cited by 301 publications
(319 citation statements)
references
References 22 publications
28
288
2
1
Order By: Relevance
“…As noted earlier, the amplitude envelope of speech contains a range of modulations at different temporal rates, with the 'modulation spectrum' within the envelope typically showing the highest power between 2-12 Hz (and peaking around 3−5 Hz irrespective of differences in language or speech rate; see Houtgast and Steeneken 1985;Greenberg et al 2003;Greenberg 2006). These observations have been related to the neural encoding of speech by a family of "multi-time resolution models" of speech processing developed in the field of auditory neuroscience (e.g., Poeppel 2003;Hickok and Poeppel 2007;Ghitza and Greenberg 2009;Ghitza 2011).…”
Section: The Brain: Oscillatory Neuronal Entrainment and Speech Encodingmentioning
confidence: 85%
See 1 more Smart Citation
“…As noted earlier, the amplitude envelope of speech contains a range of modulations at different temporal rates, with the 'modulation spectrum' within the envelope typically showing the highest power between 2-12 Hz (and peaking around 3−5 Hz irrespective of differences in language or speech rate; see Houtgast and Steeneken 1985;Greenberg et al 2003;Greenberg 2006). These observations have been related to the neural encoding of speech by a family of "multi-time resolution models" of speech processing developed in the field of auditory neuroscience (e.g., Poeppel 2003;Hickok and Poeppel 2007;Ghitza and Greenberg 2009;Ghitza 2011).…”
Section: The Brain: Oscillatory Neuronal Entrainment and Speech Encodingmentioning
confidence: 85%
“…drive round, pick my children back up" is shown in Figure 1a. As described by Giraud and Poeppel (2012), rhythmic structure in the envelope is given by regular modulations of signal energy over time, which peak for speech at a rate of 3-5 Hz, the "syllable rate" (Greenberg et al 2003).…”
Section: Discrimination Of Amplitude Modulation and Rise Time In Dyslmentioning
confidence: 99%
“…Our choice of AM and FM frequencies was informed by knowledge of the characteristics of natural speech; that is, we chose a theta-band AM frequency in the range of the syllable envelope (20)(21)(22) and a delta-band FM frequency in the range of prosodic contour fluctuations (23). Thus, it might be argued that the observed comodulation of psychophysical performance was specific to our choice of a slow FM and a faster AM.…”
Section: Entrained Neural Phase In Multiple Frequency Bands Comodulatesmentioning
confidence: 99%
“…In particular, we chose an amplitude modulation (AM) rate in the range of the speech syllable envelope (5.075 Hz, refs. [20][21][22] and a slower frequency modulation (FM) rate in the range of prosodic fluctuations (3.1 Hz, ref. 23; Fig.…”
mentioning
confidence: 99%
“…Phonetic descriptions have traditionally focused on either read text or citation-form (i.e., formal, carefully pronounced) speech. However, the phonetic properties of careful speech often differ from spontaneous speech [3][4][5][6][7]. Because speech technology is becoming increasingly focused on real-world applications, it is important to quantitatively analyze spontaneous material.…”
Section: Introductionmentioning
confidence: 99%