Seeing to hear better: evidence for early audio-visual interactions in speech identification

Schwartz, Jean‐Luc; Berthommier, Frédéric; Savariaux, Christophe

doi:10.1016/j.cognition.2004.01.006

Cited by 281 publications

(121 citation statements)

References 18 publications

Supporting

Mentioning

107

Contrasting

Unclassified

Order By: Relevance

“…Congruent audiovisual words are known to be recognized more easily and the visual information provided by the face and mouth before the onset of the auditory signal contributes to the recognition of the uttered word (e.g. Schwartz et al, 2004;van Wassenhove et al, 2005). However, such processing advantages should facilitate the perception of emotional and neutral words in a similar way.…”

Section: Discussionmentioning

confidence: 96%

See 1 more Smart Citation

Affective responses to emotional words are boosted in communicative situations

Rohr

Rahman

2015

NeuroImage

View full text Add to dashboard Cite

Section: Discussionmentioning

confidence: 96%

“…Paulmann and Pell, 2011;Paulmann et al, 2009;see Klasen et al, 2012 for a review) and non-emotional speech signals (e.g. Schwartz et al, 2004;van Wassenhove et al, 2005) and is mandatorily processed (e.g. de Gelder and Vroomen, 2000) already during early perceptual processing stages (e.g.…”

Section: Introductionmentioning

confidence: 98%

Affective responses to emotional words are boosted in communicative situations

Rohr

Rahman

2015

NeuroImage

View full text Add to dashboard Cite

“…Although CI users are able to integrate their visuoauditory signal efficiently and compensate for the loss of spectral information, none of the naïve NH subjects listening to CI stimulations reach the same level of VA supraadditive integration. Altogether, we suggest that CI users have developed specific visuoauditory skills that lead to a powerful utilization of the visual spatiotemporal cues (29) provided by the lip and face movements (10), allowing these patients to reach near-perfect performance in visuoauditory situations. Using our computational model that allows us to avoid ceiling effects in subjects' performance, we confirmed that the performance of CI patients derived not only from higher efficiency in speechreading, but also from the acquisition of a higher skill level in multisensory integration when visual speech information is matched to an impoverished auditory signal.…”

Section: Discussionmentioning

confidence: 99%

Evidence that cochlear-implanted deaf patients are better multisensory integrators

Rouger

Lagleyre

Fraysse

et al. 2007

Proc. Natl. Acad. Sci. U.S.A.

252

276

View full text Add to dashboard Cite

The cochlear implant (CI) is a neuroprosthesis that allows profoundly deaf patients to recover speech intelligibility. This recovery goes through long-term adaptative processes to build coherent percepts from the coarse information delivered by the implant. Here we analyzed the longitudinal postimplantation evolution of word recognition in a large sample of CI users in unisensory (visual or auditory) and bisensory (visuoauditory) conditions. We found that, despite considerable recovery of auditory performance during the first year postimplantation, CI patients maintain a much higher level of word recognition in speechreading conditions compared with normally hearing subjects, even several years after implantation. Consequently, we show that CI users present higher visuoauditory performance when compared with normally hearing subjects with similar auditory stimuli. This better performance is not only due to greater speechreading performance, but, most importantly, also due to a greater capacity to integrate visual input with the distorted speech signal. Our results suggest that these behavioral changes in CI users might be mediated by a reorganization of the cortical network involved in speech recognition that favors a more specific involvement of visual areas. Furthermore, they provide crucial indications to guide the rehabilitation of CI patients by using visually oriented therapeutic strategies.cochlear implant ͉ deafness ͉ multisensory integration ͉ speech comprehension D espite the apparent division between sensory modalities from the receptors to high cortical levels, we can simultaneously integrate visual and auditory signals resulting in qualitative percepts distinct from those derived from a single unisensory stimulus (1, 2). Furthermore, in cases of precise temporal or spatial congruency between the bisensory stimuli, multisensory integration is expressed at the behavioral level by perceptual improvements by reducing ambiguity (3, 4) and at the neuronal level by enhancing neuronal activity (5). Multisensory integration is also essential for speech recognition, which is based on the simultaneous integration of visual information derived from lip movements and auditory cues produced by the talker (6). The McGurk effect, in which a mismatch between the visual and auditory speech signals is artificially introduced, reveals that the visual information derived from lip movements can strongly influence our auditory perception (7). Although we might not be aware of the relevance of the visual cues for normal speech recognition, the influence of vision becomes convincingly apparent when the auditory information is embedded in noise. In degraded auditory conditions, the visuoauditory presentation leads to higher performance of recognition, when compared with the auditory alone stimulation (8, 9), in a mechanism that mimics an improvement in the acoustic signal-to-noise ratio (SNR) (10).In normally hearing (NH) subjects, although speechreading performance is very low, the association during development between the...

show abstract

“…The next question addressed was whether the visual onset cue could be provided by any visible information, even non-speech, or if it was specific to seeing the articulatory gestures through lip movements. Some hints that the effect might be speechspecific are available from previous studies showing that the audibility of speech sounds embedded in noise is improved by seeing coherent lip movements, but that the enhancement is decreased or eliminated if lip movements are replaced by bars going up and down in synchrony with the original lip movements [37,38]. Therefore, in an original experiment reported next, we tested whether the visual onset effect observed by Sato et al [36] would occur when the lip movements of /paaa/ and /taaa/ were replaced by a vertical bar varying in height.…”

Section: Multimodal Nature Of Verbal Transformationsmentioning

confidence: 99%

Perceptuo-motor interactions in the perceptual organization of speech: evidence from the verbal transformation effect

Basirat

Schwartz

Sato

2012

Phil. Trans. R. Soc. B

Self Cite

View full text Add to dashboard Cite

The verbal transformation effect (VTE) refers to perceptual switches while listening to a speech sound repeated rapidly and continuously. It is a specific case of perceptual multistability providing a rich paradigm for studying the processes underlying the perceptual organization of speech. While the VTE has been mainly considered as a purely auditory effect, this paper presents a review of recent behavioural and neuroimaging studies investigating the role of perceptuo-motor interactions in the effect. Behavioural data show that articulatory constraints and visual information from the speaker's articulatory gestures can influence verbal transformations. In line with these data, functional magnetic resonance imaging and intracranial electroencephalography studies demonstrate that articulatory-based representations play a key role in the emergence and the stabilization of speech percepts during a verbal transformation task. Overall, these results suggest that perceptuo (multisensory)-motor processes are involved in the perceptual organization of speech and the formation of speech perceptual objects.

show abstract

Seeing to hear better: evidence for early audio-visual interactions in speech identification

Cited by 281 publications

References 18 publications

Affective responses to emotional words are boosted in communicative situations

Affective responses to emotional words are boosted in communicative situations

Evidence that cochlear-implanted deaf patients are better multisensory integrators

Perceptuo-motor interactions in the perceptual organization of speech: evidence from the verbal transformation effect

Contact Info

Product

Resources

About