Sathya Buršić scite author profile

When automatic facial expression recognition is applied to video sequences of speaking subjects, the recognition accuracy has been noted to be lower than with video sequences of still subjects. This effect known as the speaking effect arises during spontaneous conversations, and along with the affective expressions the speech articulation process influences facial configurations. In this work we question whether, aside from facial features, other cues relating to the articulation process would increase emotion recognition accuracy when added in input to a deep neural network model. We develop two neural networks that classify facial expressions in speaking subjects from the RAVDESS dataset, a spatio-temporal CNN and a GRU cell RNN. They are first trained on facial features only, and afterwards both on facial features and articulation related cues extracted from a model trained for lip reading, while varying the number of consecutive frames provided in input as well. We show that using DNNs the addition of features related to articulation increases classification accuracy up to 12%, the increase being greater with more consecutive frames provided in input to the model.

show abstract

A Quantitative Evaluation Framework of Video De-Identification Methods

Buršić

D’Amelio

Granato

et al. 2021

View full text Add to dashboard Cite

Yes, Echo-Chambers Mislead You Too: A Game-Based Educational Experience to Reveal the Impact of Social Media Personalization Algorithms

Lomonaco

Taibi

Trianni

et al. 2023

View full text Add to dashboard Cite

Real-time face mask position recognition system based on MobileNet model

et al. 2023

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Sathya Buršić

Anomaly Detection from Log Files Using Unsupervised Deep Learning

Improving the Accuracy of Automatic Facial Expression Recognition in Speaking Subjects with Deep Learning

A Quantitative Evaluation Framework of Video De-Identification Methods

Yes, Echo-Chambers Mislead You Too: A Game-Based Educational Experience to Reveal the Impact of Social Media Personalization Algorithms

Real-time face mask position recognition system based on MobileNet model

Contact Info

Product

Resources

About