2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2017
DOI: 10.1109/cvprw.2017.282
|View full text |Cite
|
Sign up to set email alerts
|

Facial Expression Recognition Using Enhanced Deep 3D Convolutional Neural Networks

Abstract: Deep Neural Networks (DNNs) have shown to outperform traditional methods in various visual recognition tasks including Facial Expression Recognition (FER). In spite of efforts made to improve the accuracy of FER systems using DNN, existing methods still are not generalizable enough in practical applications. This paper proposes a 3D Convolutional Neural Network method for FER in videos. This new network architecture consists of 3D Inception-ResNet layers followed by an LSTM unit that together extracts the spat… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
151
0
1

Year Published

2017
2017
2023
2023

Publication Types

Select...
4
3
2

Relationship

1
8

Authors

Journals

citations
Cited by 223 publications
(152 citation statements)
references
References 64 publications
0
151
0
1
Order By: Relevance
“…Connie et al [4] employed SIFT and dense SIFT, while Kaya et al [15] employed SIFT, HOG and Local Gabor Binary Patterns (LGBP). While most works studied facial expression recognition from static images as we do here, some works approached facial expression recognition in video [11,15].…”
Section: Related Artmentioning
confidence: 99%
“…Connie et al [4] employed SIFT and dense SIFT, while Kaya et al [15] employed SIFT, HOG and Local Gabor Binary Patterns (LGBP). While most works studied facial expression recognition from static images as we do here, some works approached facial expression recognition in video [11,15].…”
Section: Related Artmentioning
confidence: 99%
“…With the advent of computer vision and speech recognition, research on emotion recognition using facial expression and speech has gained prevalence [6] [7]. Hasani and Mahoor [25] proposed an enhanced neural network architecture that consists of a 3D version of the Inception-ResNet network followed by a long short-term memory (LSTM) unit for emotion recognition from facial expressions in videos. They employed four databases in classifying different emotions, including anger, fear, disgust, sadness, neutrality, contempt, happiness, and surprise.…”
Section: Emotion Recognitionmentioning
confidence: 99%
“…Later on in [11], different combination of components both on F and the shortcut was investigated. Hasani et al [7] proposed a 3D ResNet based model for the task of facial expression recognition in which the shortcut was replaced with element-wise multiplication of the weight function ω and the input layer x l as follows:…”
Section: A Breg-netmentioning
confidence: 99%