Adaptation to spectrally-rotated speech

Green, Tim; Rosen, Stuart; Faulkner, Andrew; Paterson, Ruth

doi:10.1121/1.4812759

Cited by 14 publications

(8 citation statements)

References 43 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…It has a largely unchanged pitch profile, where some vowels remain relatively unchanged and some voice and manner cues are preserved. However, it is still unintelligible without significant training (Azadpour & Balaban, 2008; Blesser, 1972; Green, Rosen, Faulkner, & Paterson, 2013).…”

Section: Methodsmentioning

confidence: 99%

Getting the Cocktail Party Started: Masking Effects in Speech Perception

Evans

McGettigan

Agnew

et al. 2016

Journal of Cognitive Neuroscience

Self Cite

View full text Add to dashboard Cite

Spoken conversations typically take place in noisy environments and different kinds of masking sounds place differing demands on cognitive resources. Previous studies, examining the modulation of neural activity associated with the properties of competing sounds, have shown that additional speech streams engage the superior temporal gyrus. However, the absence of a condition in which target speech was heard without additional masking made it difficult to identify brain networks specific to masking and to ascertain the extent to which competing speech was processed equivalently to target speech. In this study, we scanned young healthy adults with continuous functional Magnetic Resonance Imaging (fMRI), whilst they listened to stories masked by sounds that differed in their similarity to speech. We show that auditory attention and control networks are activated during attentive listening to masked speech in the absence of an overt behavioural task. We demonstrate that competing speech is processed predominantly in the left hemisphere within the same pathway as target speech but is not treated equivalently within that stream, and that individuals who perform better in speech in noise tasks activate the left midposterior superior temporal gyrus more. Finally, we identify neural responses associated with the onset of sounds in the auditory environment, activity was found within right lateralised frontal regions consistent with a phasic alerting response. Taken together, these results provide a comprehensive account of the neural processes involved in listening in noise.

show abstract

Section: Methodsmentioning

confidence: 99%

Getting the Cocktail Party Started: Masking Effects in Speech Perception

Evans

McGettigan

Agnew

et al. 2016

Journal of Cognitive Neuroscience

Self Cite

View full text Add to dashboard Cite

show abstract

“…The acoustic signal was first equalized with a filter (essentially high-pass) that gave the rotated signal approximately the same long-term spectrum as the original. This equalizing filter (33-point finite impulse response [FIR]) was constructed based on measurements of the long-term average spectrum of speech ( Byrne et al 1994 ), although the roll-off below 120 Hz was ignored, and a flat spectrum below 420 Hz was assumed ( Scott, Rosen, et al 2009 ; Green et al 2013 ). The equalized signal was then amplitude modulated by a sinusoid at 4 kHz, followed by low-pass filtering at 3.8 kHz.…”

Section: Methodsmentioning

confidence: 99%

Feel the Noise: Relating Individual Differences in Auditory Imagery to the Structure and Function of Sensorimotor Systems

Lima

Lavan

Evans³

et al. 2015

Cereb. Cortex

View full text Add to dashboard Cite

Humans can generate mental auditory images of voices or songs, sometimes perceiving them almost as vividly as perceptual experiences. The functional networks supporting auditory imagery have been described, but less is known about the systems associated with interindividual differences in auditory imagery. Combining voxel-based morphometry and fMRI, we examined the structural basis of interindividual differences in how auditory images are subjectively perceived, and explored associations between auditory imagery, sensory-based processing, and visual imagery. Vividness of auditory imagery correlated with gray matter volume in the supplementary motor area (SMA), parietal cortex, medial superior frontal gyrus, and middle frontal gyrus. An analysis of functional responses to different types of human vocalizations revealed that the SMA and parietal sites that predict imagery are also modulated by sound type. Using representational similarity analysis, we found that higher representational specificity of heard sounds in SMA predicts vividness of imagery, indicating a mechanistic link between sensory- and imagery-based processing in sensorimotor cortex. Vividness of imagery in the visual domain also correlated with SMA structure, and with auditory imagery scores. Altogether, these findings provide evidence for a signature of imagery in brain structure, and highlight a common role of perceptual–motor interactions for processing heard and internally generated auditory information.

show abstract

“…Support for this assumption comes from observations that even after years of experience with a second language, non-native speakers still have speech perception difficulties in adverse listening conditions (e.g., Conrad, 1989; and see Lecumberri et al, 2010, for review). Nevertheless, a prolonged learning phase has been documented in a small, but growing, number of training studies on distorted speech in which participants experienced many hundreds (and even thousands) of trials, sometimes across multiple training sessions (Stacey and Summerfield, 2008;Song et al, 2012;Green et al, 2013). The goal of the current study was therefore to explicitly test the hypothesis that longer training on a speech identification task can yield more learning and generalization than shorter-term training.…”

Section: Introductionmentioning

confidence: 98%

The effects of training length on the perceptual learning of time-compressed speech and its generalization

Banai

Lavner

2014

The Journal of the Acoustical Society of America

View full text Add to dashboard Cite

Brief exposure to time-compressed speech yields both learning and generalization. Whether such learning continues over the course of multi-session training and if so whether it is more or less specific than exposure-induced learning is not clear, because the outcomes of intensive practice with time-compressed speech have rarely been reported. The goal here was to determine whether prolonged training on time-compressed speech yields additional learning and generalization beyond that induced by brief exposure. Listeners practiced the semantic verification of time-compressed sentences for one or three training sessions. Identification of trained and untrained tokens was subsequently compared between listeners who trained for one or three sessions, listeners who were briefly exposed to 20 time-compressed sentences and naive listeners. Trained listeners outperformed the other groups of listeners on the trained condition, but only the group that was trained for three sessions outperformed the other groups when tested with untrained tokens. These findings suggest that although learning of distorted speech can occur rapidly, more stable learning and generalization might be achieved with longer, multi-session practice. It is suggested that the findings are consistent with the framework proposed by the Reverse Hierarchy Theory of perceptual learning.

show abstract

Adaptation to spectrally-rotated speech

Cited by 14 publications

References 43 publications

Getting the Cocktail Party Started: Masking Effects in Speech Perception

Getting the Cocktail Party Started: Masking Effects in Speech Perception

Feel the Noise: Relating Individual Differences in Auditory Imagery to the Structure and Function of Sensorimotor Systems

The effects of training length on the perceptual learning of time-compressed speech and its generalization

Contact Info

Product

Resources

About