2018
DOI: 10.1007/978-3-030-11027-7_19
|View full text |Cite
|
Sign up to set email alerts
|

Emotion Recognition of a Group of People in Video Analytics Using Deep Off-the-Shelf Image Embeddings

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
3
2

Relationship

1
4

Authors

Journals

citations
Cited by 6 publications
(6 citation statements)
references
References 12 publications
0
6
0
Order By: Relevance
“…For example, experiments from paper [ 17 ] clearly demonstrate that usage of TinyFaces detector provide lower accuracy of group emotion recognition when compared to Viola-Jones detector, if VGGFace features are used [ 15 ]. In such case, it is possible that we should use other facial descriptors trained on facial images with small resolution [ 1 ].…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…For example, experiments from paper [ 17 ] clearly demonstrate that usage of TinyFaces detector provide lower accuracy of group emotion recognition when compared to Viola-Jones detector, if VGGFace features are used [ 15 ]. In such case, it is possible that we should use other facial descriptors trained on facial images with small resolution [ 1 ].…”
Section: Discussionmentioning
confidence: 99%
“…Prediction of group-level emotion [ 1 ] and cohesiveness is very useful for various companies in order to analyze employee’s emotional state throughout the day and build a relationship between their emotional state and group cohesion [ 2 ]. As the cohesiveness of a group is a crucial indicator of success of a group of people, the problem of predicting the perceived cohesiveness of a group of people in image becomes one of the main tasks in the EmotiW (Emotion Recognition in the Wild) 2019 challenge [ 3 ].…”
Section: Introductionmentioning
confidence: 99%
“…are either a combination of non-neural classifiers [63] or a combination of a Long Short-Term Memory (LSTM) and a dense layer [64]. In [65], CNNs are employed for emotion prediction for individual faces, fusing the individual predictions to form a group prediction.…”
Section: Images and Videosmentioning
confidence: 99%
“…In these studies, features are in some way combined before being fed to a classifier. Liu et al [75] use a simple average of face features, and [63] also average individual embeddings to get one image feature. In [21] and [22], group arousal is calculated from the ratio of responding individuals to all individuals.…”
Section: Feature-level Fusionmentioning
confidence: 99%
See 1 more Smart Citation