2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019
DOI: 10.1109/cvpr.2019.01123
|View full text |Cite
|
Sign up to set email alerts
|

Expressive Body Capture: 3D Hands, Face, and Body From a Single Image

Abstract: To facilitate the analysis of human actions, interactions and emotions, we compute a 3D model of human body pose, hand pose, and facial expression from a single monocular image. To achieve this, we use thousands of 3D scans to train a new, unified, 3D model of the human body, SMPL-X, that extends SMPL with fully articulated hands and an expressive face. Learning to regress the parameters of SMPL-X directly from images is challenging without paired images and 3D ground truth. Consequently, we follow the approac… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
1,070
0
5

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
1
1

Relationship

1
6

Authors

Journals

citations
Cited by 1,283 publications
(1,076 citation statements)
references
References 65 publications
1
1,070
0
5
Order By: Relevance
“…3D point clouds or meshes ( Figure 1c), have become a popular and elegant way to capture the soft-tissue and shape of bodies, which are highly important features for person identification, fashion (i.e. clothing sales), and in medicine (21,42,43). However, state-of-the-art performance currently requires body-scanning of many subjects to make body models.…”
Section: Dense-representations Of Bodiesmentioning
confidence: 99%
“…3D point clouds or meshes ( Figure 1c), have become a popular and elegant way to capture the soft-tissue and shape of bodies, which are highly important features for person identification, fashion (i.e. clothing sales), and in medicine (21,42,43). However, state-of-the-art performance currently requires body-scanning of many subjects to make body models.…”
Section: Dense-representations Of Bodiesmentioning
confidence: 99%
“…Recent methods based on deep learning, extend 3D human pose estimation to complex scenes [32,42,48,50] but the 3D accuracy is limited. To estimate human-scene interaction, however, more realistic body models are needed that include fully articulated hands such as in [31,49].…”
Section: Related Workmentioning
confidence: 99%
“…Thus, we penalize poses in which the body interpenetrates scene objects. We formulate this "exclusion principle" as a differentiable loss function that we incorporate into the SMPLify-X pose estimation method [49].…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations