2020
DOI: 10.1109/access.2020.3001637
|View full text |Cite
|
Sign up to set email alerts
|

Fast and Accurate 3D Hand Pose Estimation via Recurrent Neural Network for Capturing Hand Articulations

Abstract: 3D hand pose estimation from a single depth image plays an important role in computer vision and human-computer interaction. Although recent hand pose estimation methods using convolution neural network (CNN) have shown notable improvements in accuracy, most of them have a limitation that they rely on a complex network structure without fully exploiting the articulated structure of the hand. A hand, which is an articulated object, is composed of six local parts: the palm and five independent fingers. Each fing… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 14 publications
(6 citation statements)
references
References 31 publications
0
6
0
Order By: Relevance
“…The performance of the proposed method was compared with previous state-of-the-art methods using a 2D or 3D depth image input. The 3D input models are a 3DCNN [16], SHPR-Net [29], HandPointNet [30], Point-to-Point [31], and V2V-PoseNet [14], and the 2D input models are a DeepModel [19], Deep-Prior [12], Deep-Prior++ [13], Feedback [32], REN-4x6x6 [25], REN-9x6x6 [33], Pose-REN [17], CrossInfoNet [28], HCRNN [34], and A2J [35].…”
Section: Comparison With State-of-the-art Methodsmentioning
confidence: 99%
“…The performance of the proposed method was compared with previous state-of-the-art methods using a 2D or 3D depth image input. The 3D input models are a 3DCNN [16], SHPR-Net [29], HandPointNet [30], Point-to-Point [31], and V2V-PoseNet [14], and the 2D input models are a DeepModel [19], Deep-Prior [12], Deep-Prior++ [13], Feedback [32], REN-4x6x6 [25], REN-9x6x6 [33], Pose-REN [17], CrossInfoNet [28], HCRNN [34], and A2J [35].…”
Section: Comparison With State-of-the-art Methodsmentioning
confidence: 99%
“…Using these synthetic datasets, we can also compare the spectrum of poses covered by the currently available datasets and hence cover a broader spectrum of poses for training. Analyzing the history of the hands' motion using methods such as recurrent neural networks (Yoo et al, 2020 ) instead of processing only one instance of the hand can avoid erratic motions during self-occlusions and will be investigated in another study for adding the feature to the SSC-CNN. The history can include the velocity and acceleration of the joint motions, which also have biomechanical bounds and further enhance the pose realism during hand motion tracking.…”
Section: Conclusion Limitations and Future Workmentioning
confidence: 99%
“…More recent approaches to hand pose estimation from a single depth map decomposes the task into several sub-tasks (for palm and fingers) [49], [50]. In [49], the authors adopt two-branch cross-connection structure to share the beneficial complementary information between the sub-tasks.…”
Section: Related Workmentioning
confidence: 99%