2010
DOI: 10.1007/978-3-642-15552-9_29
|View full text |Cite
|
Sign up to set email alerts
|

Modeling Temporal Structure of Decomposable Motion Segments for Activity Classification

Abstract: Abstract. Much recent research in human activity recognition has focused on the problem of recognizing simple repetitive (walking, running, waving) and punctual actions (sitting up, opening a door, hugging). However, many interesting human activities are characterized by a complex temporal composition of simple actions. Automatic recognition of such complex actions can benefit from a good understanding of the temporal structures. We present in this paper a framework for modeling motion by exploiting the tempor… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

2
533
0

Year Published

2011
2011
2022
2022

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 535 publications
(535 citation statements)
references
References 26 publications
2
533
0
Order By: Relevance
“…Moreover, some of the actions have to be learned from a very small number of training samples. To show that the approach works on other datasets, we also evaluated it on the Olympic Sports dataset [28]. The performance is 77.5% (without poselets) and 85.5% (with poselet motion features).…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Moreover, some of the actions have to be learned from a very small number of training samples. To show that the approach works on other datasets, we also evaluated it on the Olympic Sports dataset [28]. The performance is 77.5% (without poselets) and 85.5% (with poselet motion features).…”
Section: Methodsmentioning
confidence: 99%
“…The performance is 77.5% (without poselets) and 85.5% (with poselet motion features). However, the Olympic Sports dataset [28] uses a very small test set (some actions are evaluated on only 3-4 video clips). This means that the overall score for Olympic Sports can be strongly affected based on just a few test examples.…”
Section: Methodsmentioning
confidence: 99%
“…Marszalek et al [13] report classification results of average precision 0.326 using a BoW framework and STIP features, clearly showing the difficulty of the problem. The same approach applied to action classification from YouTube videos of sport events shows that BoW approaches on real world data sets need further improvement [16]. Similarly, prior work on clustering features extracted from video sequences from the CMU Multimodal Database [4] shows that several algorithms cluster samples from the same subjects, rather than discriminate samples across action classes.…”
Section: Prior Workmentioning
confidence: 99%
“…One common approach is to use space-time features to model points of interest in video [15,6]. Several authors have supplemented these techniques by adding more information to these features [11,40,41,19,25,30]. However, this approach is only capable of classifying, rather than detecting, activities.…”
Section: Introductionmentioning
confidence: 99%