CVPR 2011 2011
DOI: 10.1109/cvpr.2011.5995496
|View full text |Cite
|
Sign up to set email alerts
|

Learning hierarchical invariant spatio-temporal features for action recognition with independent subspace analysis

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

5
637
2
3

Year Published

2012
2012
2017
2017

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 783 publications
(647 citation statements)
references
References 28 publications
5
637
2
3
Order By: Relevance
“…It is approximately 7% better than the current best published results. Our method obtains better results compared to [3], [11]. Our method also gives better results compared to our baseline method since our camera position features capture the camera position variation for this complex and challenging dataset.…”
Section: Resultsmentioning
confidence: 75%
See 1 more Smart Citation
“…It is approximately 7% better than the current best published results. Our method obtains better results compared to [3], [11]. Our method also gives better results compared to our baseline method since our camera position features capture the camera position variation for this complex and challenging dataset.…”
Section: Resultsmentioning
confidence: 75%
“…Sparse local spatio-temporal (ST) features have shown advantages in human action recognition [1], [2]. Besides these hand-designed features, hierarchical invariant ST features [3] that are learned from data directly are also proposed for human action recognition. They achieve impressive performance on many realistic video datasets.…”
Section: Introductionmentioning
confidence: 99%
“…Recently, there has been a growing interest in learning visual features using biologically-inspired networks, such as, Independent Component Analysis (ICA) [11] and Independent Subspace Analysis [12]. In [13], Le shows that using their learned 3D (Spatiotemporal) filters by ISA, the action recognition performance is comparable to other hand- Fig. 2: Framework of the proposed method.…”
Section: Related Workmentioning
confidence: 62%
“…Our autoencoder model draws inspiration from Independent Subspace Analysis (ISA) [25] which was proposed for learning motion invariance. Unlike conventional sparse autoencoders which enforce sparsity in the hidden layer, the proposed autoencoder performs subspace pooling on the hidden layer activations and enforces sparsity in the pooling layer, in a way identical to ISA.…”
Section: Autoencoder With Temporal Slowness Constraintmentioning
confidence: 99%