2022
DOI: 10.1121/10.0009844
|View full text |Cite
|
Sign up to set email alerts
|

Pushing the envelope: Evaluating speech rhythm with different envelope extraction techniques

Abstract: The amplitude of the speech signal varies over time, and the speech envelope is an attempt to characterise this variation in the form of an acoustic feature. Although tacitly assumed, the similarity between the speech envelope-derived time series and that of phonetic objects (e.g., vowels) remains empirically unestablished. The current paper, therefore, evaluates several speech envelope extraction techniques, such as the Hilbert transform, by comparing different acoustic landmarks (e.g., peaks in the speech en… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 10 publications
(4 citation statements)
references
References 90 publications
0
2
0
Order By: Relevance
“…Audio was filtered to conform to the international long-term average speech spectrum (Byrne et al, 1994; Figure 1, Panel A). Long pauses ( > 500 ms) were manually restricted to 500 ms. Syllabic timing was estimated by windowing the speech recordings into 2 s chunks and automatically detecting vowel onset-like acoustic landmarks (MacIntyre et al, 2022). The grand average inter-vowel onset interval was, for English, 203.47 ms (SD 47.11 ms) and for Dutch, 202.68 ms (SD 38.96 ms).…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Audio was filtered to conform to the international long-term average speech spectrum (Byrne et al, 1994; Figure 1, Panel A). Long pauses ( > 500 ms) were manually restricted to 500 ms. Syllabic timing was estimated by windowing the speech recordings into 2 s chunks and automatically detecting vowel onset-like acoustic landmarks (MacIntyre et al, 2022). The grand average inter-vowel onset interval was, for English, 203.47 ms (SD 47.11 ms) and for Dutch, 202.68 ms (SD 38.96 ms).…”
Section: Methodsmentioning
confidence: 99%
“…Long pauses (> 500 ms) were manually removed. Syllabic timing was estimated by windowing the speech recordings into 2 s chunks and automatically detecting vowel onset-like acoustic landmarks [84]. The grand average inter-vowel onset interval was, for English, 203.47 ms (SD 47.11 ms) and for Dutch, 202.68 ms (SD 38.96 ms).…”
Section: Recording Preprocessing and Editing Of Audiomentioning
confidence: 99%
“…To operationalize speech effort, we extracted the speech amplitude envelope from the audio recordings of the interactions for each participant (Pouw & Trujillo, 2021), with a smoothing Hanning filter of 5Hz and a resampling rate of 100Hz. We then counted the number of envelope peaks per trial as a proxy for the number of syllables (see e.g., MacIntyre et al, 2022), with a peak height threshold of 0.37 (set by M-(2*SD) envelope height over all participants). Gesture Effort.…”
Section: Measures Of Communicative Effortmentioning
confidence: 99%
“…For the R code to extract the amplitude envelope from speech please see Pouw and Trujillo (2019). For our analysis, we also take the first derivative of the amplitude envelope (the change in amplitude envelope) to measure the peak attack phase of the signal (i.e., sudden rises of the amplitude envelope; see e.g., MacIntyre, Cai, & Scott, 2022).…”
Section: Acoustic and Motion Trackingmentioning
confidence: 99%