2017 12th IEEE International Conference on Automatic Face &Amp; Gesture Recognition (FG 2017) 2017
DOI: 10.1109/fg.2017.99
|View full text |Cite
|
Sign up to set email alerts
|

Spatio-Temporal Facial Expression Recognition Using Convolutional Neural Networks and Conditional Random Fields

Abstract: Automated Facial Expression Recognition (FER) has been a challenging task for decades. Many of the existing works use hand-crafted features such as LBP, HOG, LPQ, and Histogram of Optical Flow (HOF) combined with classifiers such as Support Vector Machines for expression recognition. These methods often require rigorous hyperparameter tuning to achieve good results. Recently Deep Neural Networks (DNN) have shown to outperform traditional methods in visual object recognition. In this paper, we propose a two-par… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
33
0
2

Year Published

2017
2017
2024
2024

Publication Types

Select...
4
3
1

Relationship

2
6

Authors

Journals

citations
Cited by 74 publications
(35 citation statements)
references
References 38 publications
0
33
0
2
Order By: Relevance
“…While Inception and ResNet have shown remarkable results in FER [20,52], these methods do not extract the temporal relations of the input data. Therefore, we propose a 3D Inception-ResNet architecture to address this issue.…”
Section: Proposed Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…While Inception and ResNet have shown remarkable results in FER [20,52], these methods do not extract the temporal relations of the input data. Therefore, we propose a 3D Inception-ResNet architecture to address this issue.…”
Section: Proposed Methodsmentioning
confidence: 99%
“…Extracting these temporal relations has been studied using traditional methods in the past. Examples of these attempts are Hidden Markov Models [5,60,64] (which combine temporal information and apply segmentation on videos), Spatio-Temporal Hidden Markov Models (ST-HMM) by coupling S-HMM and T-HMM [50], Dynamic Bayesian Networks (DBN) [45,63] associated with a multi-sensory information fusion strategy, Bayesian temporal models [46] to capture the dynamic facial expression transition, and Conditional Random Fields (CRFs) [19,20,25,48] and their extensions such as Latent-Dynamic Conditional Random Fields (LD-CRFs) and Hidden Conditional Random Fields (HCRFs) [58].…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Inception and ResNet have shown remarkable results in various tasks [14,21,30,34]. For the Affect-in-the-Wild challenge, we proposed Inception-ResNet based architectures followed by LSTM units (submission 3) for the task of affect estimation.…”
Section: Proposed Methodsmentioning
confidence: 99%
“…Face alignment is a traditional pre-processing step in many facerelated recognition tasks. We list some well-known approaches [16], [63] 3000 fps [64] 68 [55] Incremental [65] 49 [66] Deep learning cascaded CNN [67] 5 fast good/ very good [68] MTCNN [69] 5 [70], [71] and publicly available implementations that are widely used in deep FER. Given a series of training data, the first step is to detect the face and then to remove background and non-face areas.…”
Section: Face Alignmentmentioning
confidence: 99%