2019
DOI: 10.1109/tpami.2019.2892452
|View full text |Cite
|
Sign up to set email alerts
|

3D Human Pose Machines with Self-supervised Learning

Abstract: Driven by recent computer vision and robotic applications, recovering 3D human poses has become increasingly important and attracted growing interests. In fact, completing this task is quite challenging due to the diverse appearances, viewpoints, occlusions and inherently geometric ambiguities inside monocular images. Most of the existing methods focus on designing some elaborate priors /constraints to directly regress 3D human poses based on the corresponding 2D human pose-aware features or 2D pose prediction… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
41
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 69 publications
(41 citation statements)
references
References 54 publications
0
41
0
Order By: Relevance
“…In order to train the network without explicit 3D pose annotations, the prior of camera projection geometry was commonly explored, and some geometry-driven methods were proposed. Among them, re-projection loss is one of the most widely used technique (Kanazawa et al 2018;Wu et al 2016;Wandt and Rosenhahn 2019;Pavllo et al 2019;Wang et al 2019;Brau and Jiang 2016). However, using re-projection loss alone cannot accurately con-Figure 2: The overall architecture of our method that follows a two-stage pipeline.…”
Section: Weakly/self-supervised Approachesmentioning
confidence: 99%
“…In order to train the network without explicit 3D pose annotations, the prior of camera projection geometry was commonly explored, and some geometry-driven methods were proposed. Among them, re-projection loss is one of the most widely used technique (Kanazawa et al 2018;Wu et al 2016;Wandt and Rosenhahn 2019;Pavllo et al 2019;Wang et al 2019;Brau and Jiang 2016). However, using re-projection loss alone cannot accurately con-Figure 2: The overall architecture of our method that follows a two-stage pipeline.…”
Section: Weakly/self-supervised Approachesmentioning
confidence: 99%
“…Self-supervised learning, more in line with the law of biological learning, has been very successful at Natural Language Processing (NLP). Recently, Wang et al [177] have used self-supervised learning to achieve accurate 3D human posture estimation.…”
Section: B: Not Just Full-supervised Learningmentioning
confidence: 99%
“…Learning‐based discriminative methods are quite popular for the real‐time 2D pose estimation of single or multiple characters . More recently, a number of methods have tackled a much harder problem, that of 3D skeletal estimation from single color images or videos . A common limitation of the discriminative methods in 3D pose reconstruction, though, is that they typically run off‐line, and because they do not take into consideration the temporal consistency of motion (they reconstruct the 3D joint positions on per image), the generated motion is oscillating.…”
Section: Related Workmentioning
confidence: 99%
“…The big challenge in this context is to learn rich features to encode depth, spatial and temporal relation of the body parts so as to ensure smooth motion reconstruction . While some recent methods run at high frame (e.g., other works), they are still unsuitable for use in closely interactive characters or crowds. This is because they require a tracked bounding box for each person, and thus, they can only reconstruct the motion of a single person at a time.…”
Section: Introductionmentioning
confidence: 99%