2014
DOI: 10.1007/978-3-319-04114-8_40
|View full text |Cite
|
Sign up to set email alerts
|

Real-Time Skeleton-Tracking-Based Human Action Recognition Using Kinect Data

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
68
0

Year Published

2016
2016
2021
2021

Publication Types

Select...
6
3
1

Relationship

1
9

Authors

Journals

citations
Cited by 123 publications
(68 citation statements)
references
References 13 publications
0
68
0
Order By: Relevance
“…Although the use of the skeleton needs pre-processing, skeleton modality remains more suited to fast recognition. 74.2 9.06 HON4D (Oreifej and Liu, 2013) 89.3 76.6 17.51 SNV (Yang and Tian, 2014) 94.27 76.65 271.3 Actionlet* (Wang et al, 2012a) 87.1 69.7 0.139 LARP (Vemulapalli et al, 2014) 96.00 88.1 10.51 KSC 90 Indeed, according to (Papadopoulos et al, 2014), skeleton extraction process takes around 45ms per frame. For an action of 30 frames (very reasonable length for an action), the necessary time to extract a skeleton sequence is equal to nearly 1,35 s.…”
Section: A Trade-off Between Computational Latency and Recognition Acmentioning
confidence: 99%
“…Although the use of the skeleton needs pre-processing, skeleton modality remains more suited to fast recognition. 74.2 9.06 HON4D (Oreifej and Liu, 2013) 89.3 76.6 17.51 SNV (Yang and Tian, 2014) 94.27 76.65 271.3 Actionlet* (Wang et al, 2012a) 87.1 69.7 0.139 LARP (Vemulapalli et al, 2014) 96.00 88.1 10.51 KSC 90 Indeed, according to (Papadopoulos et al, 2014), skeleton extraction process takes around 45ms per frame. For an action of 30 frames (very reasonable length for an action), the necessary time to extract a skeleton sequence is equal to nearly 1,35 s.…”
Section: A Trade-off Between Computational Latency and Recognition Acmentioning
confidence: 99%
“…Moreover, Xia et al [25] utilize histograms of 3D joint locations (HOJ3D) as a compact representation of human postures. The spherical angles between selected joints, along with the respective angular velocities, are calculated in [15].…”
Section: Previous Work On 3d Action Recognitionmentioning
confidence: 99%
“…Indeed, RGB-D (Red Green Blue Depth) cameras provide an additional modality known as depth maps. Furthermore, with the work of Shotton et al [1], it became feasible to extract relatively accurate skeletons from depth maps in real-time (around 45ms for skeleton extraction per frame according to [3]). Although motion capture systems provide more accurate skeletons, the RGB-D cameras remain an interesting alternative given their lower cost.…”
Section: Introductionmentioning
confidence: 99%