2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019
DOI: 10.1109/cvpr.2019.01220
|View full text |Cite
|
Sign up to set email alerts
|

Point-To-Pose Voting Based Hand Pose Estimation Using Residual Permutation Equivariant Layer

Abstract: Recently, 3D input data based hand pose estimation methods have shown state-of-the-art performance, because 3D data capture more spatial information than the depth image. Whereas 3D voxel-based methods need a large amount of memory, PointNet based methods need tedious preprocessing steps such as K-nearest neighbour search for each point. In this paper, we present a novel deep learning hand pose estimation method for an unordered point cloud. Our method takes 1024 3D points as input and does not require additio… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
72
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
5
1

Relationship

2
4

Authors

Journals

citations
Cited by 82 publications
(72 citation statements)
references
References 33 publications
0
72
0
Order By: Relevance
“…They usually require a good initialization; otherwise they are susceptible to getting stuck in local minima. Discriminative methods learn a direct mapping from visual observations to hand poses [23,27,10,13,31,2]. Thanks to large-scale annotated datasets [31,29,23], deep learningbased discriminative methods have shown very strong performance in the hand pose estimation task.…”
Section: Related Workmentioning
confidence: 99%
See 4 more Smart Citations
“…They usually require a good initialization; otherwise they are susceptible to getting stuck in local minima. Discriminative methods learn a direct mapping from visual observations to hand poses [23,27,10,13,31,2]. Thanks to large-scale annotated datasets [31,29,23], deep learningbased discriminative methods have shown very strong performance in the hand pose estimation task.…”
Section: Related Workmentioning
confidence: 99%
“…Moon et al [11] use 3D voxels as input and regress the hand pose with a 3D CNN. More recent works [10,5] apply 3D point clouds as input and can estimate very accurate hand poses.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations