2014 IEEE Conference on Computer Vision and Pattern Recognition 2014
DOI: 10.1109/cvpr.2014.102
|View full text |Cite
|
Sign up to set email alerts
|

Multiple Granularity Analysis for Fine-Grained Action Detection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
38
0

Year Published

2016
2016
2024
2024

Publication Types

Select...
4
4
2

Relationship

0
10

Authors

Journals

citations
Cited by 54 publications
(38 citation statements)
references
References 20 publications
0
38
0
Order By: Relevance
“…Early work in activity recognition involved using hidden Markov models to learn latent action states [58], followed by discriminative SVM models that used key poses and action grammars [31,48,35]. Similar works have used hand-crafted features [40] or object-centric features [30] to recognize actions in fixed camera settings. More recent works have used dense trajectories [51] or deep learning features [19] to study actions.…”
Section: Related Workmentioning
confidence: 99%
“…Early work in activity recognition involved using hidden Markov models to learn latent action states [58], followed by discriminative SVM models that used key poses and action grammars [31,48,35]. Similar works have used hand-crafted features [40] or object-centric features [30] to recognize actions in fixed camera settings. More recent works have used dense trajectories [51] or deep learning features [19] to study actions.…”
Section: Related Workmentioning
confidence: 99%
“…Most existing work on temporal action segmentation focuses on a fully supervised task [15,26,31,19,32,22,6,37,18,36,28]. Purely CNN based approaches such as structured segment networks [37] or temporal convolutional networks [18] have recently shown convincing results on several action segmentation benchmarks.…”
Section: Related Workmentioning
confidence: 99%
“…In [10,22], human actions are modeled as space-time structures, using deformable part models [2]. In [15,18] discriminative handcentric features are explored for fine grained activity detection in cooking, i.e., relatively short sub-activities such as chop and fill. In [3], the detector is trained on CNN features extracted from the action tubes in space-time; however, evaluation is on relatively short video clips (i.e., several hundred frames) of relatively short actions.…”
Section: Related Workmentioning
confidence: 99%