2017 IEEE Winter Conference on Applications of Computer Vision (WACV) 2017
DOI: 10.1109/wacv.2017.31
|View full text |Cite
|
Sign up to set email alerts
|

Recognition of Group Activities in Videos Based on Single-and Two-Person Descriptors

Stephane Lathuiliere,
Georgios Evangelidis,
Radu Horaud

Abstract: Group activity recognition from videos is a very challenging problem that has barely been addressed. We propose an activity recognition method using group context. In order to encode both single-person description and two-person interactions, we learn mappings from highdimensional feature spaces to low-dimensional dictionaries. In particular the proposed two-person descriptor takes into account geometric characteristics of the relative pose and motion between the two persons. Both single-person and two-person … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

1
3
0

Year Published

2018
2018
2019
2019

Publication Types

Select...
2
2

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 37 publications
1
3
0
Order By: Relevance
“…Furthermore, our approac presents a homogeneous error distribution among the classes, as indicated by the lowest standard deviation. Our overall accuracy was similar to (Sun et al, 2016) and (Amer et al, 2014), and better than recent approaches such as SSVM (Lathuilière et al, 2017) and RMIC (Wang et al, 2017). It should be noticed that the best overall method DCGF+GRU (Kim et al, 2018) was trained and evaluated with augmented data, so that a direct comparison of the results might be biased.…”
Section: Collective Behavior Recognitionsupporting
confidence: 57%
See 1 more Smart Citation
“…Furthermore, our approac presents a homogeneous error distribution among the classes, as indicated by the lowest standard deviation. Our overall accuracy was similar to (Sun et al, 2016) and (Amer et al, 2014), and better than recent approaches such as SSVM (Lathuilière et al, 2017) and RMIC (Wang et al, 2017). It should be noticed that the best overall method DCGF+GRU (Kim et al, 2018) was trained and evaluated with augmented data, so that a direct comparison of the results might be biased.…”
Section: Collective Behavior Recognitionsupporting
confidence: 57%
“…With the objective of detecting groups and their characteristics in a crowd scenario, Shao et al (2016) define priors that aim to add temporal smoothness and consistency for collective transitionsgroups are obtained by searching for pedestrians sets that fit well these priors. Lathuilière et al (2017) presented an approach for group activity recognition using single and two-person descriptors using Structured Support Vector Machines (SSVMs). For that purpose, they use a dictionary-based method, exploring geometric characteristics of the relative pose and motion between the two persons.…”
Section: Introductionmentioning
confidence: 99%
“…Yin et al [162] employ 3D-SIFT to describe local motion events, but used a HOF to model the global motion in an image. Similarly, Lathuilière et al [73] combined HOG descriptors and trajectory information from linked local features. Single-person and two-person interaction attributes such as "two persons are standing side-by-side" were calculated from these features.…”
Section: Template Based Approachesmentioning
confidence: 99%
“…Yin et al [303] employ 3D-SIFT to describe local motion events, but used a HOF to model the global motion of the video sequence. Similarly, Lathuiliere et al [138] combined HOG descriptors and trajectory information from linked local features. Single-person and two-person interaction attributes, such as "two persons are standing side-by-side", were calculated from these features.…”
Section: Template-based Approachesmentioning
confidence: 99%