Procedings of the British Machine Vision Conference 2012 2012
DOI: 10.5244/c.26.18
|View full text |Cite
|
Sign up to set email alerts
|

Genetic Programming-Evolved Spatio-Temporal Descriptor for Human Action Recognition

Abstract: The potential value of human action recognition has led to it becoming one of the most active research subjects in computer vision. In this paper, we propose a novel method to automatically generate low-level spatio-temporal descriptors showing good performance, for high-level human-action recognition tasks. We address this as an optimization problem using genetic programming (GP), an evolutionary method, which produces the descriptor by combining a set of primitive 3D operators. As far as we are aware, this i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
13
0

Year Published

2013
2013
2024
2024

Publication Types

Select...
5
3

Relationship

2
6

Authors

Journals

citations
Cited by 16 publications
(13 citation statements)
references
References 34 publications
0
13
0
Order By: Relevance
“…The first group combines hand-crafted skeleton features and graphical models to recognize actions. The spatio-temporal representations from skeleton sequences are often modeled by several common probabilistic graphical models such as Hidden Markov Model (HMM) (Lv and Nevatia, 2006;Wang et al, 2012;Yang et al, 2013), Latent Dirichlet Allocation (LDA) (Blei et al, 2003;Liu et al, 2012) or Conditional Random Field (CRF) (Koppula and Saxena, 2013). In addition, Fourier Temporal Pyramid (FTP) (Wang et al, 2012;Vemulapalli et al, 2014;Hu et al, 2015) has also been used to capture the temporal dynamics of actions and then to predict their labels.…”
Section: Related Workmentioning
confidence: 99%
“…The first group combines hand-crafted skeleton features and graphical models to recognize actions. The spatio-temporal representations from skeleton sequences are often modeled by several common probabilistic graphical models such as Hidden Markov Model (HMM) (Lv and Nevatia, 2006;Wang et al, 2012;Yang et al, 2013), Latent Dirichlet Allocation (LDA) (Blei et al, 2003;Liu et al, 2012) or Conditional Random Field (CRF) (Koppula and Saxena, 2013). In addition, Fourier Temporal Pyramid (FTP) (Wang et al, 2012;Vemulapalli et al, 2014;Hu et al, 2015) has also been used to capture the temporal dynamics of actions and then to predict their labels.…”
Section: Related Workmentioning
confidence: 99%
“…In [ 16 ], an aggregation function (average value) is tested, which, in fact, outperforms other approaches based on decision-level fusion. Conversely, in [ 25 ], single-view features are successfully joined using a concatenation of vectors and, therefore, preserving all the characteristic data. More sophisticated techniques can also be found, as in [ 26 ], where canonical correlation analysis is employed.…”
Section: Related Workmentioning
confidence: 99%
“…In this paper, we use GP to automatically synthesize spatio-temporal descriptors from a set of 3D filters and operators for dynamic hand gesture recognition. A simplified version of our method has been applied to extract features for action recognition in [18].…”
Section: Related Workmentioning
confidence: 99%