2019
DOI: 10.1109/tpami.2018.2816031
|View full text |Cite
|
Sign up to set email alerts
|

MonoCap: Monocular Human Motion Capture using a CNN Coupled with a Geometric Prior

Abstract: Recovering 3D full-body human pose is a challenging problem with many applications. It has been successfully addressed by motion capture systems with body worn markers and multiple cameras. In this paper, we address the more challenging case of not only using a single camera but also not leveraging markers: going directly from 2D appearance to 3D geometry. Deep learning approaches have shown remarkable abilities to discriminatively learn 2D appearance features. The missing piece is how to integrate 2D, 3D and … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

1
104
0

Year Published

2019
2019
2019
2019

Publication Types

Select...
5
4
1

Relationship

1
9

Authors

Journals

citations
Cited by 185 publications
(105 citation statements)
references
References 69 publications
(154 reference statements)
1
104
0
Order By: Relevance
“…Our network predicts the normalized locations of 3D joints. During testing, to calibrate the scale of the outputs, we require that the sum of length of all 3D bones is equal to that of a canonical skeleton as shown in [41,75,78]. Therefore, we follow the method in [75] for calibration.…”
Section: Datasets and Evaluation Protocolsmentioning
confidence: 99%
“…Our network predicts the normalized locations of 3D joints. During testing, to calibrate the scale of the outputs, we require that the sum of length of all 3D bones is equal to that of a canonical skeleton as shown in [41,75,78]. Therefore, we follow the method in [75] for calibration.…”
Section: Datasets and Evaluation Protocolsmentioning
confidence: 99%
“…There is rich recent literature on 3D pose estimation in the form of a simplistic body skeleton, e.g., [3,19,22,24,25,29,30,34,35,38,40,41,42,50,51]. However, in this Section, we focus on the more relevant works recovering the full shape and pose of the human body.…”
Section: Related Workmentioning
confidence: 99%
“…Weakly Supervised: Approaches such as [3,10,44,53,54,55] do not explicitly use paired 2D-3D correspondences, but use unpaired 3D data to learn priors on shape (3D basis) or pose (articulation priors). For example, Zhou et al [54] use a 3D pose dictionary to learn pose priors and Brau et al [3] employ an independently trained network that learns a prior distribution over 3D poses (kinematic and self-intersection priors).…”
Section: Related Workmentioning
confidence: 99%