Procedings of the British Machine Vision Conference 2008 2008
DOI: 10.5244/c.22.109
|View full text |Cite
|
Sign up to set email alerts
|

Information Theoretic Key Frame Selection for Action Recognition

Abstract: This paper presents an approach for human action recognition by finding the discriminative key frames from a video sequence and representing them with the distribution of local motion features and their spatiotemporal arrangements. In this approach, the key frames of the video sequence are selected by their discriminative power and represented by the local motion features detected in them and integrated from their temporal neighbors. In the key frame's representation, the spatial arrangements of the motion fea… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
31
0
1

Year Published

2009
2009
2022
2022

Publication Types

Select...
5
3
2

Relationship

0
10

Authors

Journals

citations
Cited by 52 publications
(33 citation statements)
references
References 23 publications
1
31
0
1
Order By: Relevance
“…Table 1 also compares our results with the existing approaches proposed recently, which are not restricted to interest points based methods. It shows that our results are close to the best result reported so far on each dataset, and outperforms most METHOD KTH WEZIMANN Our approach 93.17% 96.66% Fathi et al [5] 90.5% 100% Zhang et al [22] 91.33% 92.89% Kläser et al [9] 91.4% 84.3% Niebles et al [11] 83.3% 90.0% Dollar et al [3] 81.17% 85.2% Liu et al [10] 94.16% -Zhao et al [23] 91.17% -Gilbert et al [6] 89.92% -Savarese et al [17] 86.83% -Nowozin et al [12] 84.72% - recently proposed methods, especially those tested on both datasets.…”
Section: Recognition Ratesupporting
confidence: 55%
“…Table 1 also compares our results with the existing approaches proposed recently, which are not restricted to interest points based methods. It shows that our results are close to the best result reported so far on each dataset, and outperforms most METHOD KTH WEZIMANN Our approach 93.17% 96.66% Fathi et al [5] 90.5% 100% Zhang et al [22] 91.33% 92.89% Kläser et al [9] 91.4% 84.3% Niebles et al [11] 83.3% 90.0% Dollar et al [3] 81.17% 85.2% Liu et al [10] 94.16% -Zhao et al [23] 91.17% -Gilbert et al [6] 89.92% -Savarese et al [17] 86.83% -Nowozin et al [12] 84.72% - recently proposed methods, especially those tested on both datasets.…”
Section: Recognition Ratesupporting
confidence: 55%
“…Keyframes: A number of approaches have proposed the use of keyframes as a representation. Carlson et al [5] use a single, manually selected, keyframe and shape matching to classify tennis strokes; [39] rank all frames based on holistic information theoretic measure to select the top 25% for classification using voting; [20] rely on spatio-temporal localization as pre-processing and use AdaBoost to select keyframes (making up from 13% to 20% of the sequence length). Unlike [5], we automatically select keyframes and return, not require as [20], spatial-temporal localization; our keyframe representation is also more compact utilizing fewer (up to 4, or 4% of the sequence) keyframes.…”
Section: Sequential Modelsmentioning
confidence: 99%
“…To obtain visually distinct representations, Cooper and Foote [30] presented methods for key frame selection based on capturing the similarity to the represented segment and preserving the differences from other segments' key frames, so that different segments will have visually distinct representations. Zhao and Elgammal [31] developed an effective approach for action classification, in which they first described all the poses with the distribution of local motion features and their spatiotemporal arrangements and then selected a small set of most discriminative poses by comparing their discriminative power for each independent action. Zhuang et al [32] applied an unsupervised clustering method for key-pose selection.…”
Section: B Contributionsmentioning
confidence: 99%