Proceedings of the 20th ACM International Conference on Multimodal Interaction 2018
DOI: 10.1145/3242969.3264991
|View full text |Cite
|
Sign up to set email alerts
|

Cascade Attention Networks For Group Emotion Recognition with Face, Body and Image Cues

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
27
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 38 publications
(27 citation statements)
references
References 13 publications
0
27
0
Order By: Relevance
“…Both use classic descriptors for faces as well as upper bodies. In [80], CNNs are used for analysis of faces, scenes, and bodies, and [81] and adds a skeleton analysis to the face and scene analysis, all done with CNNs. Faces, scenes and skeletons are also analysed with CNNs in [82], where on the face-level the CNN output is fed to an LSTM, and where on the scene-level an attention mask is placed over the image.…”
Section: Hybrid Approachesmentioning
confidence: 99%
See 3 more Smart Citations
“…Both use classic descriptors for faces as well as upper bodies. In [80], CNNs are used for analysis of faces, scenes, and bodies, and [81] and adds a skeleton analysis to the face and scene analysis, all done with CNNs. Faces, scenes and skeletons are also analysed with CNNs in [82], where on the face-level the CNN output is fed to an LSTM, and where on the scene-level an attention mask is placed over the image.…”
Section: Hybrid Approachesmentioning
confidence: 99%
“…Faces, scenes, and upper bodies [34], [35] Faces, scenes, and bodies/skeletons [80], [81], [82] Faces, scenes, skeletons, [2], [42] and visual attentions/objects Faces and objects [83] Faces, scenes, and places [24] and scene analysis), or fusion of individual emotions in a bottom-up approach.…”
Section: Aspects Description Studiesmentioning
confidence: 99%
See 2 more Smart Citations
“…Traditionally, the analysis of the affective states of people has been done using individual data: meaning that only one person is present in the data stream. In the recent past, various papers on group emotion recognition, meaning the ability of extracting both grouped and individual affective states from still images has gained some popularity [29,30,17,31,33]. However, the recognition throught time of the affective state of individuals involved in group interactions has not yet been thoroughly addressed.…”
Section: Introductionmentioning
confidence: 99%