Proceedings of the 19th ACM International Conference on Multimodal Interaction 2017
DOI: 10.1145/3136755.3143007
|View full text |Cite
|
Sign up to set email alerts
|

Group-level emotion recognition using transfer learning from face identification

Abstract: In this paper we describe our algorithmic approach, which was used for submissions in the fifth Emotion Recognition in the Wild (EmotiW 2017) group-level emotion recognition sub-challenge. We extracted feature vectors of detected faces using the Convolutional Neural Network trained for face identification task, rather than traditional pre-training on emotion recognition problems. In the final pipeline an ensemble of Random Forest classifiers was learned to predict emotion score using available training set. In… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
37
0
1

Year Published

2018
2018
2022
2022

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 59 publications
(38 citation statements)
references
References 30 publications
0
37
0
1
Order By: Relevance
“…Recognition rate baseline [11] 52.97 VGG-face [55] 65.41 VGG-16 [55] 64.11 Resnet-50 [55] 62.65 Xception [55] 60.18 Facial emotion CNN [25] 69.97 VGG-19 scene [25] 67.2 Face-pretrained CNN [23] 60 InceptionV3-FC [23] 63.19 VGG16-FC [23] 66.30 Fusion of Scene and VGG-face [22] 65.0 Ensemble of classifiers [55] 66.51 Face-pretrained CNN + InceptionV3-FC [23] 70.09 Face-pretrained CNN + VGG16-FC [23] 72.38 SVM-GAK (RVLBP) 67.32 SVM-GAK (CNN) 70.67 SVM-CGAK 72.17…”
Section: Methodsmentioning
confidence: 99%
See 4 more Smart Citations
“…Recognition rate baseline [11] 52.97 VGG-face [55] 65.41 VGG-16 [55] 64.11 Resnet-50 [55] 62.65 Xception [55] 60.18 Facial emotion CNN [25] 69.97 VGG-19 scene [25] 67.2 Face-pretrained CNN [23] 60 InceptionV3-FC [23] 63.19 VGG16-FC [23] 66.30 Fusion of Scene and VGG-face [22] 65.0 Ensemble of classifiers [55] 66.51 Face-pretrained CNN + InceptionV3-FC [23] 70.09 Face-pretrained CNN + VGG16-FC [23] 72.38 SVM-GAK (RVLBP) 67.32 SVM-GAK (CNN) 70.67 SVM-CGAK 72.17…”
Section: Methodsmentioning
confidence: 99%
“…Comparing with multi-modal approaches listed in the lines 11-14, SVM-GAK can as well obtain the promising results. SVM-CGAK achieved higher recognition rate (72.17%) than Fusion of Scene and VGG-face (65.0%) [22], Face-pretrained CNN + InceptionV3-FC (70.09%) [55] and Ensemble of classifier (66.51%) [55], while SVM-CGAK obtained a little worse performance comparing with hybrid network (72.38%) [55]. Different from the hybrid network [55], they combined face and scene information.…”
Section: Group-level Facial Expression Recognitionmentioning
confidence: 96%
See 3 more Smart Citations