2021
DOI: 10.1121/10.0005938
|View full text |Cite
|
Sign up to set email alerts
|

The effects of lexical content, acoustic and linguistic variability, and vocoding on voice cue perception

Abstract: Perceptual differences in voice cues, such as fundamental frequency (F0) and vocal tract length (VTL), can facilitate speech understanding in challenging conditions. Yet, we hypothesized that in the presence of spectrotemporal signal degradations, as imposed by cochlear implants (CIs) and vocoders, acoustic cues that overlap for voice perception and phonemic categorization could be mistaken for one another, leading to a strong interaction between linguistic and indexical (talker-specific) content. Fifteen norm… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

4
17
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5

Relationship

1
4

Authors

Journals

citations
Cited by 12 publications
(21 citation statements)
references
References 67 publications
4
17
0
Order By: Relevance
“…The acoustic features of signal may therefore not have been identical across Experiments 1 and 2. However, the acoustic voice cues that have been shown to be most salient for the discrimination between voices of different talkers—fundamental frequency (F0) 39 , 44 46 and vocal tract length 47 —were not altered by Experiment 2. The long term average spectrum 48 , another cue used by listeners to differentiate between voices 49 , also remained consistent across our experiments.…”
Section: Discussionmentioning
confidence: 87%
See 2 more Smart Citations
“…The acoustic features of signal may therefore not have been identical across Experiments 1 and 2. However, the acoustic voice cues that have been shown to be most salient for the discrimination between voices of different talkers—fundamental frequency (F0) 39 , 44 46 and vocal tract length 47 —were not altered by Experiment 2. The long term average spectrum 48 , another cue used by listeners to differentiate between voices 49 , also remained consistent across our experiments.…”
Section: Discussionmentioning
confidence: 87%
“…Though our results strongly implicate pragmatic context in speaker change detection, our experimental design does not allow us so strictly rule out any influence of acoustic cues. Research suggests a strong interaction between top-down lexical and sentential content and indexical (talker-specific) content 39 , 40 : the presence of linguistic context at the word and sentence levels affects participants’ discrimination of voice cues. In order to test the causal role of pragmatic context in speaker change detection and to further rule out any effect of acoustic cues in Experiment 1, we ran a follow up experiment in which we removed all pragmatic information from the same stimuli, leaving acoustic information intact.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…In the current study, an 8-channel vocoder was used to simulate the same number of electrode contact points in each vocoder condition, but spectral resolution was varied by manipulating the sharpness of the bandpass filter slopes to simulate low, medium, and high spread of excitation in the cochlea. Decreased spectral resolution via simulation of increased channel interaction has been found to result in less accurate speech recognition (Fu and Nogaki, 2005;Bingabr et al, 2008;Wang et al, 2012;Oxenham and Kreft, 2014;Winn et al, 2016;Mehta et al, 2020), less accurate pitch perception (Crew et al, 2012;Mehta and Oxenham, 2017), increased listening effort (Winn et al, 2016), and limitations in the perception of non-linguistic aspects of speech, such as voice cue perception (Gaudrain and Başkent, 2015;Koelewijn et al, 2021). In the current study, while there was a main effect of vocoder condition, only differences between the condition with the worst spectral resolution (HS; 4th order, 24 dB/octave) and the conditions with increasingly more favorable spectral resolutions (MS and LS; 8th order, 48 dB/octave and 12th order, 72 dB/octave, respectively) emerged.…”
Section: Discussionmentioning
confidence: 99%
“…To investigate the effect of spectral resolution on speech clarity, the current study manipulated the sharpness of the slope of the bandpass filters to simulate current spread in the cochlea. The amount of spread of excitation in the cochlea determines the extent to which individual stimulation channels of the implant interact (e.g., Black and Clark, 1980;Bingabr et al, 2008;Gaudrain and Başkent, 2015;Koelewijn et al, 2021). Three vocoder conditions were included to simulate low spread (LS), medium spread (MS), and high spread (HS) of excitation (and decreasing spectral resolution, respectively), in order to obtain varying degrees of degradation.…”
Section: The Current Studymentioning
confidence: 99%