2019
DOI: 10.1007/978-3-030-11024-6_18
|View full text |Cite
|
Sign up to set email alerts
|

Spatial-Temporal Attention Res-TCN for Skeleton-Based Dynamic Hand Gesture Recognition

Abstract: Dynamic hand gesture recognition is a crucial yet challenging task in computer vision. The key of this task lies in an effective extraction of discriminative spatial and temporal features to model the evolutions of different gestures. In this paper, we propose an end-to-end Spatial-Temporal Attention Residual Temporal Convolutional Network (STA-Res-TCN) for skeleton-based dynamic hand gesture recognition, which learns different levels of attention and assigns them to each spatialtemporal feature extracted by t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
63
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 104 publications
(63 citation statements)
references
References 26 publications
0
63
0
Order By: Relevance
“…DSTM [14] is excluded from this experiment for not giving the reference data and experiment result. HG-GCN improves 0.3% accuracy over STA-Res-TCN [12] for the complicated 28 gestures setting, which shows the advantage of the proposed method on recognizing fine gestures. As for the comparison with other methods, HG-GCN shows its advantages obviously.…”
Section: Resultsmentioning
confidence: 82%
See 2 more Smart Citations
“…DSTM [14] is excluded from this experiment for not giving the reference data and experiment result. HG-GCN improves 0.3% accuracy over STA-Res-TCN [12] for the complicated 28 gestures setting, which shows the advantage of the proposed method on recognizing fine gestures. As for the comparison with other methods, HG-GCN shows its advantages obviously.…”
Section: Resultsmentioning
confidence: 82%
“…3a: four joints for each finger, one joint (1) for the palm, and one joint (0) for the wrist. The thumb contains joints (5, 4, 3, 2), the index finger contains joints (9, 8, 7, 6), the middle finger contains joints (13,12,11,10), the ring finger contains joints (17,16,15,14), and the pinkie contains joints (21,20,19,18). Yan et al linked the 18 joints of the human body skeleton with 17 edges from head to foot in order.…”
Section: Hand Gesture Graph Convnetmentioning
confidence: 99%
See 1 more Smart Citation
“…For skeleton-based action recognition, two types of input features are commonly used: the geometric feature [18], [22] and the Cartesian coordinate feature [31], [32], [34], [6], [7]. The Cartesian coordinate feature is variant to locations and viewpoints.…”
Section: A Modeling Location-viewpoint Invariant Feature By Joint Comentioning
confidence: 99%
“…However, these handcrafted features cannot model the spatial and temporal information effectively. Recently, several deep learning-based works [4,15,16] are proposed, which feed hand skeleton sequence into LSTM or CNN networks for hand gesture recognition.…”
Section: Introductionmentioning
confidence: 99%