2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2017
DOI: 10.1109/cvpr.2017.494
|View full text |Cite
|
Sign up to set email alerts
|

Hand Keypoint Detection in Single Images Using Multiview Bootstrapping

Abstract: We present an approach that uses a multi-camera system to train fine-grained detectors for keypoints that are prone to occlusion, such as the joints of a hand. We call this procedure multiview bootstrapping: first, an initial keypoint detector is used to produce noisy labels in multiple views of the hand. The noisy detections are then triangulated in 3D using multiview geometry or marked as outliers. Finally, the reprojected triangulations are used as new labeled training data to improve the detector. We repea… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
748
1
4

Year Published

2017
2017
2023
2023

Publication Types

Select...
5
3
1

Relationship

1
8

Authors

Journals

citations
Cited by 1,004 publications
(754 citation statements)
references
References 28 publications
1
748
1
4
Order By: Relevance
“…where y t (p) is the true location andŷ t (p) ∈ Y is the predicted location of keypoint p Another metric, Probability of Correct Keypoints (PCK) [3,35], is also used to evaluate all models. If a predicted keypoint lies inside a sphere (of radius σ ) around the ground truth, the prediction is deemed correct.…”
Section: Objective Evaluation Metricsmentioning
confidence: 99%
“…where y t (p) is the true location andŷ t (p) ∈ Y is the predicted location of keypoint p Another metric, Probability of Correct Keypoints (PCK) [3,35], is also used to evaluate all models. If a predicted keypoint lies inside a sphere (of radius σ ) around the ground truth, the prediction is deemed correct.…”
Section: Objective Evaluation Metricsmentioning
confidence: 99%
“…When the face landmark tracking fails, our physical simulation will maintain a natural facial expression, but the system will not be able to match the subject's expression. We note that very recent facial landmarks trackers such as OpenPose [SJMS17] could alleviate this issue. The landmark tracker's results suffer from jittering issues which also affect the final animation.…”
Section: Limitations and Future Workmentioning
confidence: 99%
“…The body tracking algorithm uses a modified version of the Realtime Multi-Person Pose Estimation algorithm [10]. Faces and hands are tracked using OpenPose [1,10,11,12]. Using this software, we are able to compute facial landmark, head pose, eyes and facial Action Units.…”
Section: Storytellers' Informationmentioning
confidence: 99%
“…The emergence of RGB-D sensors (like the Kinect) and of Fab-Labs with their new widespread technologies (3D printing, laser cutting ...) lets people envision developing lighter and cheaper systems. In parallel, the increasing performance of human perception algorithms thanks to Deep Learning [1,10,11,12] tends to improve human analysis capabilities of interaction systems. Even if interaction systems take advantages of these recent progresses, there are still challenging problems to address.…”
Section: Introductionmentioning
confidence: 99%