Developmental Trajectory of Audiovisual Speech Integration in Early Infancy. A Review of Studies Using the McGurk Paradigm

Tomalski, Przemysław

doi:10.1515/plc-2015-0006

Cited by 12 publications

(10 citation statements)

References 133 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Although a growing body of evidence demonstrates that substantial fine-tuning for various forms of audiovisual processing continues throughout childhood and well into adolescence (Baart et al, 2015 ; Tomalski, 2015 ), suffice it to say that at least some primitive form of multimodal perception emerges in early infancy (Bahrick et al, 2004 ). This can be characterized as guided by both modal cues (i.e., those that are specific to a single modality, such as color information in the visual domain or the timbre of someone's voice in the auditory domain) and amodal ones (i.e., those that are available across modalities and are thus redundant; Bahrick, 1988 ).…”

Section: Introductionmentioning

confidence: 99%

Sources of Confusion in Infant Audiovisual Speech Perception Research

Shaw

Bortfeld

2015

Front. Psychol.

View full text Add to dashboard Cite

Speech is a multimodal stimulus, with information provided in both the auditory and visual modalities. The resulting audiovisual signal provides relatively stable, tightly correlated cues that support speech perception and processing in a range of contexts. Despite the clear relationship between spoken language and the moving mouth that produces it, there remains considerable disagreement over how sensitive early language learners—infants—are to whether and how sight and sound co-occur. Here we examine sources of this disagreement, with a focus on how comparisons of data obtained using different paradigms and different stimuli may serve to exacerbate misunderstanding.

show abstract

Section: Introductionmentioning

confidence: 99%

Sources of Confusion in Infant Audiovisual Speech Perception Research

Shaw

Bortfeld

2015

Front. Psychol.

View full text Add to dashboard Cite

show abstract

“…Hick et al [34] concluded from their work that SLI children demonstrated slower development of the short term memory. Tomalski [35] reported that human speech is a multisensory experience and the most important modalities for language comprehension and production are visual spatial modalities. They reported that integrity of the social pragmatics aspects resulted from adequacy of audiovisual processing of the speech.…”

Section: Assessment Of Visual Working Memory (Wm)mentioning

confidence: 99%

Panorama of the Non-Verbal Cognitive Abilities Among Children with SLI

Fahiem

Mohammed

2020

Egyptian Journal of Ear, Nose, Throat and Allied Sciences

View full text Add to dashboard Cite

“…Several studies with infants as young as four months found that their pattern of illusory phoneme detection was similar to that of adults [13,28]. However, other studies have found different results in male and female infants depending on the experimental design [29,30], showing that the effect is not as strong or consistent as in adults. Therefore, the literature suggests that infants are sensitive to cases of temporal and phonological audiovisual synchrony, suggesting a possible innate audiovisual integration skill.…”

Section: Introductionmentioning

confidence: 99%

Modeling the Development of Audiovisual Cue Integration in Speech Perception

et al. 2017

View full text Add to dashboard Cite

Adult speech perception is generally enhanced when information is provided from multiple modalities. In contrast, infants do not appear to benefit from combining auditory and visual speech information early in development. This is true despite the fact that both modalities are important to speech comprehension even at early stages of language acquisition. How then do listeners learn how to process auditory and visual information as part of a unified signal? In the auditory domain, statistical learning processes provide an excellent mechanism for acquiring phonological categories. Is this also true for the more complex problem of acquiring audiovisual correspondences, which require the learner to integrate information from multiple modalities? In this paper, we present simulations using Gaussian mixture models (GMMs) that learn cue weights and combine cues on the basis of their distributional statistics. First, we simulate the developmental process of acquiring phonological categories from auditory and visual cues, asking whether simple statistical learning approaches are sufficient for learning multi-modal representations. Second, we use this time course information to explain audiovisual speech perception in adult perceivers, including cases where auditory and visual input are mismatched. Overall, we find that domain-general statistical learning techniques allow us to model the developmental trajectory of audiovisual cue integration in speech, and in turn, allow us to better understand the mechanisms that give rise to unified percepts based on multiple cues.

show abstract

Developmental Trajectory of Audiovisual Speech Integration in Early Infancy. A Review of Studies Using the McGurk Paradigm

Cited by 12 publications

References 133 publications

Sources of Confusion in Infant Audiovisual Speech Perception Research

Sources of Confusion in Infant Audiovisual Speech Perception Research

Panorama of the Non-Verbal Cognitive Abilities Among Children with SLI

Modeling the Development of Audiovisual Cue Integration in Speech Perception

Contact Info

Product

Resources

About