2016 Fourth International Conference on 3D Vision (3DV) 2016
DOI: 10.1109/3dv.2016.84
|View full text |Cite
|
Sign up to set email alerts
|

3D Human Pose Estimation via Deep Learning from 2D Annotations

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
30
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 48 publications
(30 citation statements)
references
References 28 publications
0
30
0
Order By: Relevance
“…Alternatively, the 3D pose can be predicted from a given set of 2D keypoints by simply predicting their depth [58]. Some works enforce priors about bone lengths and projection consistency with the 2D ground truth [2]. Video pose estimation.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Alternatively, the 3D pose can be predicted from a given set of 2D keypoints by simply predicting their depth [58]. Some works enforce priors about bone lengths and projection consistency with the 2D ground truth [2]. Video pose estimation.…”
Section: Related Workmentioning
confidence: 99%
“…For Mask R-CNN, we adopt a ResNet-101 backbone trained with the "stretched 1x" schedule [12]. 2 When finetuning the model on Human3.6M, we reinitialize the last layer of the keypoint network, as well as the deconv layers that regress the heatmaps to learn a new set of keypoints. We train on 4 GPUs with a step-wise decaying learning rate: 1e-3 for 60k iterations, then 1e-4 for 10k iterations, and 1e-5 for 10k iterations.…”
Section: Implementation Details For 2d Pose Estimationmentioning
confidence: 99%
“…Projection Layer ProjLayer: Recent work in 3D body pose estimation has integrated projection layers to leverage 2D-only annotated data for training 3D pose prediction [3]. Since our training dataset provides perfect 3D ground truth, we employ our projection layer merely as refinement module to link the 2D and 3D predictions.…”
Section: A2 Regnet Networkmentioning
confidence: 99%
“…Existing approaches use bone length and depth ordering constraints [Mori and Malik 2006;Taylor 2000], sparsity assumptions [Wang et al 2014;Zhou et al 2015,a], joint limits [Akhter and Black 2015], inter-penetration constraints [Bogo et al 2016], temporal dependencies [Rhodin et al 2016b], and regression [Yasin et al 2016]. Treating 3D pose as a hidden variable in 2D estimation is an alternative [Brau and Jiang 2016]. However, the sparse set of 2D locations loses image evidence, e.g.…”
Section: Multi-viewmentioning
confidence: 99%