2017
DOI: 10.1007/s11042-017-5017-y
|View full text |Cite
|
Sign up to set email alerts
|

Supervised spatio-temporal kernel descriptor for human action recognition from RGB-depth videos

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
7
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 13 publications
(7 citation statements)
references
References 46 publications
0
7
0
Order By: Relevance
“…If there is one of the K Gaussian distributions that satisfies the condition of formula (15), it is determined that the pixel is matched with it, and the mean, variance, and weight of the Gaussian distribution are updated. The update process is shown in the formula: The transition matrix of HMM 7 Wireless Communications and Mobile Computing…”
Section: Background Extraction Methodmentioning
confidence: 99%
See 1 more Smart Citation
“…If there is one of the K Gaussian distributions that satisfies the condition of formula (15), it is determined that the pixel is matched with it, and the mean, variance, and weight of the Gaussian distribution are updated. The update process is shown in the formula: The transition matrix of HMM 7 Wireless Communications and Mobile Computing…”
Section: Background Extraction Methodmentioning
confidence: 99%
“…Therefore, dance movement recognition in reality usually relies on manual processing. The main part in the dance video sequence is manually segmented, cropped into multiple small videos, and then identified in sequence [15].…”
Section: Classification Of Action Recognition Technologymentioning
confidence: 99%
“…Classification accuracy (%) DCSF [54] 89.3 HON4D [15] 88.9 Super Normal Vector [16] 93.1 Skeletons Lie group [17] 89.5 DMM-LBP-DF [55] 93.0 2D-CNN on DMM-Pyramid [44] 91.1 3D-CNN on DMM-Cube [44] 86.1 HOG3D + LLC [56] 90.9 Hierarchical 3D Kernel [57] 92.7 GLAC on DMM [13] 89.4 DMM-GLAC-STACOG [13] 94.8 3DHoT + MBC [58] 95.2 Subspace encoding [59] 94.06 LSTM + trust gates [60] 94.8 Extended SNV [61] 93.5 Trust Gates [62] 94.8 ST-NBNN [63] 94.8 SSTKDes [64] 95.6 3D-CNN + DHI + relief + SVM [65] 92.8 WDMM + HOG [66] 91.9 WDMM + LBP [66] 91.6 WDMM + CNN [66] 90.0 Deep activations [67] 92.3 Deep activations + attributes [67] 93.4 Hierarchical Gaussian [68] 95.6 GMHI + GSHI + CRC [69] 94.5 MHF + SHF + KELM [36] 95.97 Spatiotemporal + HMM [70] 92. Experimental evaluation of our approach on UTD-MHAD dataset is represented by Table 5.…”
Section: Approachmentioning
confidence: 99%
“…Some of these works are IJIUS 9,1 discussed in this section. The holistic feature, STIP is considered as a base feature for the dictionary (Killedar and Sasi, 2014;Roy et al, 2016;Asadi-Aghbolaghi and Kasaei, 2018;Nazir et al, 2018;Uddin and Lee, 2019). The sparse coding (sparse matrix construction) is performed using the dictionary and the input signal (STIP value of the video frames).…”
Section: Related Workmentioning
confidence: 99%