2012
DOI: 10.1016/j.imavis.2012.06.005
|View full text |Cite
|
Sign up to set email alerts
|

Static and dynamic 3D facial expression recognition: A comprehensive survey

Abstract: a b s t r a c t a r t i c l e i n f oAutomatic facial expression recognition constitutes an active research field due to the latest advances in computing technology that make the user's experience a clear priority. The majority of work conducted in this area involves 2D imagery, despite the problems this presents due to inherent pose and illumination variations. In order to deal with these problems, 3D and 4D (dynamic 3D) recordings are increasingly used in expression analysis research. In this paper we survey… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
171
0
2

Year Published

2012
2012
2018
2018

Publication Types

Select...
5
2
2
1

Relationship

3
7

Authors

Journals

citations
Cited by 328 publications
(173 citation statements)
references
References 105 publications
(215 reference statements)
0
171
0
2
Order By: Relevance
“…Thus, p f p k is the output feature map with value ranging from 0 to 15, which is further transformed to the histogram for feature representation. For each face patch or region, the corresponding feature GSF is then vectorized as a 16 × n s × n a dimension vector, where n s , n a are defined in Equation (1). Finally, the feature of the i-th expression sample is represented as…”
Section: Feature Extractionmentioning
confidence: 99%
“…Thus, p f p k is the output feature map with value ranging from 0 to 15, which is further transformed to the histogram for feature representation. For each face patch or region, the corresponding feature GSF is then vectorized as a 16 × n s × n a dimension vector, where n s , n a are defined in Equation (1). Finally, the feature of the i-th expression sample is represented as…”
Section: Feature Extractionmentioning
confidence: 99%
“…The image-transform-based descriptors, i.e., PCA, DCT and DWT [36], are the most commonly used feature representations for visual speech processing tasks [50], [51]. LBP has been widely used as a robust image compression technique for texture representation [37], and it is one of the most commonly used facial appearance descriptors in face recognition and facial expression recognition [39], [40]. HOG was first applied for human detection in images [38], but has proved successful in a wide range of computer vision problems, like recently in lip activity analysis [52].…”
Section: ) Mouth Region Of Interest (Roi) Extractionmentioning
confidence: 99%
“…A model for emotion recognition using facial Points Localization Model has been developed in [4]. Sandbach made a comprehensive survey on the developments of 3D and 4D facial expression recognition, and reviewed the tracking and alignment methods [5]. A novel phase congruency based descriptor for dynamic facial expression analysis which is robust to image scale and illumination variations was introduced in [6].…”
Section: Introductionmentioning
confidence: 99%