The integration of facial and vocal cues during emotional change perception: EEG markers

Chen, Xuhai; Zhihui, Pan; Wang, Ping; Yang, Xiaohong; Liu, Peng; You, Xuqun; Yuan, Jiajin

doi:10.1093/scan/nsv083

Cited by 39 publications

(28 citation statements)

References 50 publications

(118 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In a study on bimodal emotion integration, P300 amplitudes were larger for audiovisual emotion stimuli than for visual emotion stimuli; the authors suggested that the bimodal stimuli led to a “dual novelty” in the cognitive task comprising visual and auditory stimuli, which enabled subjects to actively process multisensory information (Chen, Han, et al, ). Similar findings were also reported in some studies on the sensitivity of P300 to emotional face–voice stimuli (Campanella et al, ), the integration of facial and vocal emotion perception (Chen, Pan, et al, ), and emotion recognition tasks (Liu et al, ). In addition, changes in age and sex features in voices have also been shown to increase subjects' attention to and perception of stimuli (Li et al, ).…”

Section: Discussionsupporting

confidence: 88%

“…In our study, the accuracy and RBR were significantly greater in the E‐AV spelling paradigm than those in the E‐V spelling paradigm at superposing one and two times; thus, the comparison of accuracy and RBR between the spelling paradigms was at superposing one and two times. From the Figure , we observed that the accuracy of the E‐AV was higher than that in 2012 (Jin et al, ) and 2016 (Chen, Pan, et al, ) at one and two superpositions and the RBR of the E‐AV was higher than that in the 2012, 2014 (Jin et al, ), and 2016 at one and two superpositions. The accuracy of the spelling paradigm is affected by several factors, such as the arrangement of the characters, visual angle (subjects spelled more accurately on a larger visual angle than on a smaller visual angle (Li, Nam, Shadden, & Johnson, ), SOA (increased SOA would result in a larger P300 amplitude to improve the classification accuracy [Lu, Speier, Hu, & Pouratian, ]), and other factors.…”

Section: Discussionmentioning

confidence: 76%

“…Chen, Han, Pan, Luo, and Wang () reported that P300 amplitudes were larger for bimodal stimuli (face and voice with emotion) than for the sum of two unimodal stimuli (face stimulus and voice stimulus). Other studies also showed increased P300 amplitudes for audiovisual emotion stimuli (Chen, Pan, et al, ; Chen, Pan, Wang, Zhang, & Yuan, ). In an fMRI study on emotional voices and faces, the bilateral posterior superior temporal gyrus (pSTG) and the right thalamus showed enhanced activation and strength of the BOLD response during bimodal conditions (Kreifelts, Ethofer, Grodd, Erb, & Wildgruber, ).…”

Section: Introductionmentioning

confidence: 70%

See 2 more Smart Citations

Happy emotion cognition of bimodal audiovisual stimuli optimizes the performance of the P300 speller

Gao

et al. 2019

Brain and Behavior

View full text Add to dashboard Cite

Objective Prior studies of emotional cognition have found that emotion‐based bimodal face and voice stimuli can elicit larger event‐related potential (ERP) amplitudes and enhance neural responses compared with visual‐only emotional face stimuli. Recent studies on brain–computer interface have shown that emotional face stimuli have significantly improved the performance of the traditional P300 speller system, but its performance needs to be further improved for practical applications. Therefore, we herein propose a novel audiovisual P300 speller based on bimodal emotional cognition to further improve the performance of the P300 system. Methods The audiovisual P300 speller we proposed is based on happy emotions, with visual and auditory stimuli that consist of several pairs of smiling faces and audible chuckles (E‐AV spelling paradigm) of different ages and sexes. The control paradigm was the visual‐only emotional face P300 speller (E‐V spelling paradigm). Results We compared the ERP amplitudes, accuracy, and raw bit rate between the E‐AV and E‐V spelling paradigms. The target stimuli elicited significantly increased P300 amplitudes (p < .05) and P600 amplitudes (p < .05) in the E‐AV spelling paradigm compared with those in the E‐V paradigm. The E‐AV spelling paradigm also significantly improved the spelling accuracy and the raw bit rate compared with those in the E‐V paradigm at one superposition (p < .05) and at two superpositions (p < .05). Significance The proposed emotion‐based audiovisual spelling paradigm not only significantly improves the performance of the P300 speller, but also provides a basis for the development of various bimodal P300 speller systems, which is a step forward in the clinical application of brain–computer interfaces.

show abstract

Section: Discussionsupporting

confidence: 88%

Section: Discussionmentioning

confidence: 76%

Section: Introductionmentioning

confidence: 70%

See 1 more Smart Citation

Happy emotion cognition of bimodal audiovisual stimuli optimizes the performance of the P300 speller

Gao

et al. 2019

Brain and Behavior

View full text Add to dashboard Cite

show abstract

“…Subsequent research by the same group showed that supra-additive increases in the STS occurred in both congruent and incongruent conditions (albeit later in the incongruent condition), suggesting automatic integration of emotional facial and vocal expressions (Hagan et al, 2013 ). Consistent with these findings, other studies have observed theta synchronization during the integration of facial and prosodic change (Chen et al, 2015 ). Together, these findings suggest that oscillatory activity in the alpha and theta frequency bands drive the integration of facial and vocal expressions.…”

Section: Integration Of Facial Body and Vocal Expressions Of Emotiosupporting

confidence: 84%

The Functional Role of Neural Oscillations in Non-Verbal Emotional Communication

Symons

El‐Deredy

Schwartze

et al. 2016

Front. Hum. Neurosci.

View full text Add to dashboard Cite

Effective interpersonal communication depends on the ability to perceive and interpret nonverbal emotional expressions from multiple sensory modalities. Current theoretical models propose that visual and auditory emotion perception involves a network of brain regions including the primary sensory cortices, the superior temporal sulcus (STS), and orbitofrontal cortex (OFC). However, relatively little is known about how the dynamic interplay between these regions gives rise to the perception of emotions. In recent years, there has been increasing recognition of the importance of neural oscillations in mediating neural communication within and between functional neural networks. Here we review studies investigating changes in oscillatory activity during the perception of visual, auditory, and audiovisual emotional expressions, and aim to characterize the functional role of neural oscillations in nonverbal emotion perception. Findings from the reviewed literature suggest that theta band oscillations most consistently differentiate between emotional and neutral expressions. While early theta synchronization appears to reflect the initial encoding of emotionally salient sensory information, later fronto-central theta synchronization may reflect the further integration of sensory information with internal representations. Additionally, gamma synchronization reflects facilitated sensory binding of emotional expressions within regions such as the OFC, STS, and, potentially, the amygdala. However, the evidence is more ambiguous when it comes to the role of oscillations within the alpha and beta frequencies, which vary as a function of modality (or modalities), presence or absence of predictive information, and attentional or task demands. Thus, the synchronization of neural oscillations within specific frequency bands mediates the rapid detection, integration, and evaluation of emotional expressions. Moreover, the functional coupling of oscillatory activity across multiples frequency bands supports a predictive coding model of multisensory emotion perception in which emotional facial and body expressions facilitate the processing of emotional vocalizations.

show abstract

“…In a study in which participants were presented with simultaneously presented vocal and facial expressions while being asked to detect the change of emotion from neutral to anger or happiness conveyed in voice or in face [20]. The P3 associated with the detection of the emotional categorical change in both voice and face was larger than the sum of the change in single channel (see also [21]). The N1 associated with the detection of early acoustic change was dependent on whether their atention was guided to the voice or the face, with the atention to the voice yielding to a N1 in bimodal change larger than the sum of the two single modal change conditions.…”

Section: Modulation Of Brain Responses Toward Vocal Expression By Othmentioning

confidence: 99%

Tracking the Sound of Human Affection: EEG Signals Reveal Online Decoding of Socio-Emotional Expression in Human Speech and Voice

Jiang¹

2017

Emotion and Attention Recognition Based on Biological Signals and Images

View full text Add to dashboard Cite

This chapter provides a perspective from the latest EEG evidence in how brain signals enlighten the neurophysiological and neurocognitive mechanisms underlying the recognition of socioemotional expression conveyed in human speech and voice, drawing upon event-related potentials' studies (ERPs). Human sound can encode emotional meanings by diferent vocal parameters in words, real-vs. pseudo-speeches, and vocalizations. Based on the ERP indings, recent development of the three-stage model in vocal processing has highlighted initial-and late-stage processing of vocal emotional stimuli. These processes, depending on which ERP components they were mapped onto, can be divided into the acoustic analysis, relevance and motivational processing, ine-grained meaning analysis/integration/access, and higher-level social inference, as the unfolding of the time scale. ERP studies on vocal socioemotions, such as happiness, anger, fear, sadness, neutral, sincerity, conidence, and sarcasm in the human voice and speech have employed diferent experimental paradigms such as crosssplicing, crossmodality priming, oddball, stroop, etc. Moreover, task demand and listener characteristics afect the neural responses underlying the decoding processes, revealing the role of atention deployment and interpersonal sensitivity in the neural decoding of vocal emotional stimuli. Cultural orientation afects our ability to decode emotional meaning in the voice. Neurophysiological paterns were compared between normal and abnormal emotional processing in the vocal expressions, especially in schizophrenia and in congenital amusia. Future directions highlight the study on human vocal expression aligning with other nonverbal cues, such as facial and body language, and the need to synchronize listener's brain potentials with other peripheral measures.

show abstract

The integration of facial and vocal cues during emotional change perception: EEG markers

Cited by 39 publications

References 50 publications

Happy emotion cognition of bimodal audiovisual stimuli optimizes the performance of the P300 speller

Happy emotion cognition of bimodal audiovisual stimuli optimizes the performance of the P300 speller

The Functional Role of Neural Oscillations in Non-Verbal Emotional Communication

Tracking the Sound of Human Affection: EEG Signals Reveal Online Decoding of Socio-Emotional Expression in Human Speech and Voice

Contact Info

Product

Resources

About