Fusing appearance and distribution information of interest points for action recognition

Bregonzio, Matteo; Xiang, Tao; Gong, Shaogang

doi:10.1016/j.patcog.2011.08.014

Cited by 75 publications

(67 citation statements)

References 4 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The action is represented by a complementary combination of local and global features, which are BoVW representation based on interest point descriptors [6] and shape-flow descriptors [14], respectively. Our low-level features are BoVW representation based on interest point descriptors [7]. [6] uses local information within a small region and tends to generate spurious detection in background areas, which affects the performance of the low-level action descriptors of [4], [12] and [13].…”

Section: Resultsmentioning

confidence: 99%

See 1 more Smart Citation

Topic-Based Knowledge Transfer Algorithm for Cross-View Action Recognition

Chen

Yang

Gan

2014

IEICE Trans. Inf. & Syst.

View full text Add to dashboard Cite

SUMMARYCross-view action recognition is a challenging research field for human motion analysis. Appearance-based features are not credible if the viewpoint changes. In this paper, a new framework is proposed for cross-view action recognition by topic based knowledge transfer. First, Spatio-temporal descriptors are extracted from the action videos and each video is modeled by a bag of visual words (BoVW) based on the codebook constructed by the k-means cluster algorithm. Second, Latent Dirichlet Allocation (LDA) is employed to assign topics for the BoVW representation. The topic distribution of visual words (ToVW) is normalized and taken to be the feature vector. Third, in order to bridge different views, we transform ToVW into bilingual ToVW by constructing bilingual dictionaries, which guarantee that the same action has the same representation from different views. We demonstrate the effectiveness of the proposed algorithm on the IXMAS multi-view dataset. key words: cross-view human action recognition, topic distribution of visual words, latent dirichlet allocation, bilingual dictionary

show abstract

Section: Resultsmentioning

confidence: 99%

“…However, it uses local information within a small region and tends to generate spurious detection in background areas. In this paper, we adopt the algorithm proposed by [7], which overcomes the shortcomings of the Dollar detector.…”

Section: Feature Extractionmentioning

confidence: 99%

Topic-Based Knowledge Transfer Algorithm for Cross-View Action Recognition

Chen

Yang

Gan

2014

IEICE Trans. Inf. & Syst.

View full text Add to dashboard Cite

show abstract

“…Accuracy Schuldt et al [9] 71.72% Dollar et al [11] 81.2% Reddy, Shah [22] 89.79% Fathi and Mori [37] 90.5% Klaser et al [18] 91.4% Marszalek et al [12] 91.8% egonzio et al [19] 94.33% Kovashka et al [27] 94.5% Gilbert et al [14] 94.5% RMD 91.2% RMD + Outlier Detection 94% RMD + Mode Finding 92.1% Table 2: Average accuracies on the KTH dataset using the Training/Validation/Test split defined in [9].…”

Section: Methodsmentioning

confidence: 99%

“…These include local jet descriptors [9], vector of concatenated pixel gradients [11], generalisation of the SIFT and SURF descriptors [17,18,16], and the HOG/HOF descriptors [12]. The approach by Bregonzio et al [19] differs significantly from the existing interest point based representation in that only the global distribution information of interest points is exploited. The detected interest points are typically then used in a discriminative [9,11] or generative [10] model.…”

Section: Related Workmentioning

confidence: 99%

Capturing relative motion and finding modes for action recognition in the wild

Oshin¹,

Gilbert²,

Bowden³

2014

Computer Vision and Image Understanding

View full text Add to dashboard Cite

“…All the feature vectors produced by different approaches are concatenated to form a larger feature vector. Liu et al [13], Ye et al [14] and Bregonzio et al [15] employ the kernel-level fusion approach and utilize a multi-kernel classifier for combining different features. The above fusion approaches rely only on the pairwise similarities of videos without considering the highorder correlations among videos.…”

Section: Related Workmentioning

confidence: 99%

3D R Transform on Spatio-temporal Interest Points for Action Recognition

Yuan

et al. 2013

2013 IEEE Conference on Computer Vision and Pattern Recognition

View full text Add to dashboard Cite

Fusing appearance and distribution information of interest points for action recognition

Cited by 75 publications

References 4 publications

Topic-Based Knowledge Transfer Algorithm for Cross-View Action Recognition

Topic-Based Knowledge Transfer Algorithm for Cross-View Action Recognition

Capturing relative motion and finding modes for action recognition in the wild

3D R Transform on Spatio-temporal Interest Points for Action Recognition

Contact Info

Product

Resources

About