2011
DOI: 10.1007/978-3-642-19309-5_51
|View full text |Cite
|
Sign up to set email alerts
|

Sparse Coding on Local Spatial-Temporal Volumes for Human Action Recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
62
0

Year Published

2012
2012
2021
2021

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 65 publications
(62 citation statements)
references
References 22 publications
0
62
0
Order By: Relevance
“…Aiming to alleviate the quantization errors in the BoW model, sparse coding has also been introduced to action recognition to learn more compact and richer representations of human actions [48,49,12].…”
Section: Sparse Codingmentioning
confidence: 99%
See 2 more Smart Citations
“…Aiming to alleviate the quantization errors in the BoW model, sparse coding has also been introduced to action recognition to learn more compact and richer representations of human actions [48,49,12].…”
Section: Sparse Codingmentioning
confidence: 99%
“…In order to obtain a more accurate and discriminative representation, an approach by encoding local 3D spatial-temporal gradient features was proposed by Zhu et al [49] in which the sparse coding framework is used for the final action representation. A local spatial-temporal feature is transformed to a linear combination of a few atoms in a trained dictionary.…”
Section: Sparse Codingmentioning
confidence: 99%
See 1 more Smart Citation
“…Recently, many researchers have given attention to dictionary learning [4]- [6]. However, traditional dictionary learning usually neglects the similarity among the coding coefficients and have poor performances in dealing with non-linearly separable data.…”
Section: Introductionmentioning
confidence: 99%
“…Recently, it has become more and more popular to employ sparse representation-based methods for various computer vision tasks, such as image classification [1], face recognition [2], [3], and human action recognition [4]- [8]. Sparse representation-based methods for human action recognition first compute the sparse feature representation with learned dictionary and then pool over the entire video to form the final representation, where average pooling and max pooling are usually used.…”
Section: Introductionmentioning
confidence: 99%