2019 International Conference on Multimodal Interaction 2019
DOI: 10.1145/3340555.3355715
|View full text |Cite
|
Sign up to set email alerts
|

Group-level Cohesion Prediction using Deep Learning Models with A Multi-stream Hybrid Network

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
3
1
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(7 citation statements)
references
References 20 publications
0
7
0
Order By: Relevance
“…This network uses three cascaded convolutional networks to improve face detection accuracy, which makes its use very common [ 18 , 23 , 24 , 25 , 26 , 27 , 28 , 29 ]. There are other methods that also use neural networks, such as RetinaFace [ 21 , 30 ], PyramidBox [ 19 ], TinyFace [ 19 , 20 , 31 ], and the Single-Shot Scale-Invariant Face Detector (S3FD) [ 32 ]. Other methods do not use neural networks for face detection, such as the Viola–Jones algorithm [ 33 ], which uses Haar characteristics to locate the face in an image.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…This network uses three cascaded convolutional networks to improve face detection accuracy, which makes its use very common [ 18 , 23 , 24 , 25 , 26 , 27 , 28 , 29 ]. There are other methods that also use neural networks, such as RetinaFace [ 21 , 30 ], PyramidBox [ 19 ], TinyFace [ 19 , 20 , 31 ], and the Single-Shot Scale-Invariant Face Detector (S3FD) [ 32 ]. Other methods do not use neural networks for face detection, such as the Viola–Jones algorithm [ 33 ], which uses Haar characteristics to locate the face in an image.…”
Section: Related Workmentioning
confidence: 99%
“…In [ 26 ], two ResNet models were used: ResNet-18 for small faces (size less than 48 × 48) and ResNet-34 for large faces (size larger than 48 × 48). To improve the precision in the detection of individual emotions, apart from using a Dense Convolutional Network (DenseNet201), two neural networks (Inception-ResNet-v2) can be combined, as in [ 19 ], or new blocks (e.g., excitement and comprehension blocks) can be added to a neural network, as in [ 18 , 25 ].…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…It remains an open question whether a group level mimicry feature would further improve performances. For the occasion of the EmotiW 2019 challenge [10], some studies [13,16,50] classified high and low levels of cohesion on images from a corpus of images created via web crawling of various keywords related to social events [11]. They showed how facial expressions were impacting external annotations and achieved promising results at predicting cohesion from images.…”
Section: Related Work 21 Automated Approaches To Detect Cohesionmentioning
confidence: 99%
“…Gestures are often expressed by quantifying the total movement of the hands over time. Hand positions are most often detected either by using additional sensors [32,51], or by first deriving skeletondata [16,50], using software solutions such as OpenPose [7]. Body language cues can contain additional information on the intensity and synchronicity of the conversation in a group.…”
Section: Nonverbal Features From the Visual Channel Related To Cohesionmentioning
confidence: 99%