Lip Kinematics for /p/ and /b/ Production during Whispered and Voiced Speech

Higashikawa, Masahiko; Green, Jordan R.; Moore, Christopher A.; Minifie, Fred D.

doi:10.1159/000068059

Cited by 18 publications

(8 citation statements)

References 25 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…The noise-excited stimuli were thus generated from the same spectrotemporal envelope used for harmonic and inharmonic speech, just with a different excitation signal. In this respect, the stimuli differed somewhat from actual whispering, in which speakers may change their articulation relative to normal speaking 67 , 68 , and for which the spectral envelope is known to change relative to speech with normal voicing 37 , 38 .…”

Section: Methodsmentioning

confidence: 99%

Inharmonic speech reveals the role of harmonicity in the cocktail party problem

et al. 2018

View full text Add to dashboard Cite

The “cocktail party problem” requires us to discern individual sound sources from mixtures of sources. The brain must use knowledge of natural sound regularities for this purpose. One much-discussed regularity is the tendency for frequencies to be harmonically related (integer multiples of a fundamental frequency). To test the role of harmonicity in real-world sound segregation, we developed speech analysis/synthesis tools to perturb the carrier frequencies of speech, disrupting harmonic frequency relations while maintaining the spectrotemporal envelope that determines phonemic content. We find that violations of harmonicity cause individual frequencies of speech to segregate from each other, impair the intelligibility of concurrent utterances despite leaving intelligibility of single utterances intact, and cause listeners to lose track of target talkers. However, additional segregation deficits result from replacing harmonic frequencies with noise (simulating whispering), suggesting additional grouping cues enabled by voiced speech excitation. Our results demonstrate acoustic grouping cues in real-world sound segregation.

show abstract

Section: Methodsmentioning

confidence: 99%

Inharmonic speech reveals the role of harmonicity in the cocktail party problem

et al. 2018

View full text Add to dashboard Cite

show abstract

“…8 Recent research shows the role of lip kinematics in production of whispered plosives, supporting the suggestion that whispered speech and voiced speech rely on distinct motor control processes. 9 Due to these differences in the generation mechanism, the acoustic characteristics of whispered speech are different from those of normal speech.…”

Section: A Short Review Of Whispered Speech Investigationmentioning

confidence: 99%

Acoustic Analysis of Consonants in Whispered Speech

Jovičić¹,

Šarić²

2008

Journal of Voice

View full text Add to dashboard Cite

“…The study revealed that the area of contact between the palate and the tongue during the production of whispered /z/ is larger compared to that during whispered /s/. The differences in the movements of the lips during the production of whispered and neutral bilabial consonants, /b/ and /p/, were studied using both speech and facial video (Higashikawa et al, 2003). The study revealed that the average peak opening and closing velocities and the distance between the upper and the lower lip for oral opening for /b/ were significantly higher than those for /p/ while whispering.…”

mentioning

confidence: 99%

“…The study revealed that the average peak opening and closing velocities and the distance between the upper and the lower lip for oral opening for /b/ were significantly higher than those for /p/ while whispering. These studies show that exaggerated articulation occurs during the production of "voiced" whispered consonants [/z/ and /b/ from Yoshioka (2008) and a) Electronic mail: nishag@iisc.ac.in Higashikawa et al (2003), respectively]. Electro-palatography based experiments with neutral and whispered alveolar consonants, namely, /d/, /t/, and /n/, were done by Osfar (2011).…”

mentioning

confidence: 99%

Reconstruction of articulatory movements during neutral speech from those during whispered speech

Meenakshi

Ghosh

2018

The Journal of the Acoustical Society of America

View full text Add to dashboard Cite

A transformation function (TF) that reconstructs neutral speech articulatory trajectories (NATs) from whispered speech articulatory trajectories (WATs) is investigated, such that the dynamic time warped (DTW) distance between the transformed whispered and the original neutral articulatory movements is minimized. Three candidate TFs are considered: an affine function with a diagonal matrix ( A) which reconstructs one NAT from the corresponding WAT, an affine function with a full matrix ( A) and a deep neural network (DNN) based nonlinear function which reconstruct each NAT from all WATs. Experiments reveal that the transformation could be approximated well by A, since it generalizes better across subjects and achieves the least DTW distance of 5.20 (±1.27) mm (on average), with an improvement of 7.47%, 4.76%, and 7.64% (relative) compared to that with A, DNN, and the best baseline scheme, respectively. Further analysis to understand the differences in neutral and whispered articulation reveals that the whispered articulators exhibit exaggerated movements in order to reconstruct the lip movements during neutral speech. It is also observed that among the articulators considered in the study, the tongue exhibits a higher precision and stability while whispering, implying that subjects control their tongue movements carefully in order to render an intelligible whispered speech.

show abstract

Lip Kinematics for /p/ and /b/ Production during Whispered and Voiced Speech

Cited by 18 publications

References 25 publications

Inharmonic speech reveals the role of harmonicity in the cocktail party problem

Inharmonic speech reveals the role of harmonicity in the cocktail party problem

Acoustic Analysis of Consonants in Whispered Speech

Reconstruction of articulatory movements during neutral speech from those during whispered speech

Contact Info

Product

Resources

About