Learning to Dress 3D People in Generative Clothing

Ma, Qianli; Yang, Jinlong; Ranjan, Anurag; Pujades, Sergi; Pons-Moll, Gerard; Tang, Siyu; Black, Michael J.

doi:10.1109/cvpr42600.2020.00650

Cited by 306 publications

(234 citation statements)

References 46 publications

Supporting

Mentioning

233

Contrasting

Unclassified

Order By: Relevance

“…Binary user study. To perform a direct comparison, we follow [21,25], and show results of two methods to the turkers at the same time. The turkers are asked to pick the one that they think is more perceptually natural.…”

Section: Perceptual Naturalnessmentioning

confidence: 99%

PLACE: Proximity Learning of Articulation and Contact in 3D Environments

Zhang

et al. 2020

2020 International Conference on 3D Vision (3DV)

Self Cite

View full text Add to dashboard Cite

show abstract

Section: Perceptual Naturalnessmentioning

confidence: 99%

PLACE: Proximity Learning of Articulation and Contact in 3D Environments

Zhang

et al. 2020

2020 International Conference on 3D Vision (3DV)

Self Cite

View full text Add to dashboard Cite

show abstract

“…All the above methods do not provide topology-consistent dynamic human meshes with detailed surface geometry, which are very useful for training and evaluating graph-based deep neural networks. The CAPE [48] dataset is a dynamic 3D clothed human model dataset with consistent SMPL mesh topology (6890 vertices), but the geometry details are limited due to using a small number of vertices. Moreover, the color images are re-rendered, not real captured.…”

Section: B 3d Human Datasetsmentioning

confidence: 99%

Image-Guided Human Reconstruction via Multi-Scale Graph Transformation Networks

Wen

Feng

et al. 2021

IEEE Trans. on Image Process.

View full text Add to dashboard Cite

3D human reconstruction from a single image is a challenging problem. Existing methods have difficulties to infer 3D clothed human models with consistent topologies for various poses. In this paper, we propose an efficient and effective method using a hierarchical graph transformation network. To deal with large deformations and avoid distorted geometries, rather than using Euclidean coordinates directly, 3D human shapes are represented by a vertex-based deformation representation that effectively encodes the deformation and copes well with large deformations. To infer a 3D human mesh consistent with the input real image, we also use a perspective projection layer to incorporate perceptual image features into the deformation representation. Our model is easy to train and fast to converge with short test time. Besides, we present the D 2 Human (Dynamic Detailed Human) dataset, including variously posed 3D human meshes with consistent topologies and rich geometry details, together with the captured color images and SMPL models, which is useful for training and evaluation of deep frameworks, particularly for graph neural networks. Experimental results demonstrate that our method achieves more plausible and complete 3D human reconstruction from a single image, compared with several state-of-the-art methods.

show abstract

“…In 3D vision, HMR [34] applies adversarial learning to estimate 3D human body shape and pose from 2D images. CAPE [35] uses it to learn a model of people in clothing. Fernández Abrevaya et al [36] and Shamai et al [37] use adversarial training to model faces in 3D.…”

Section: Related Workmentioning

confidence: 99%

Reconstructing Human Body Mesh from Point Clouds by Adversarial GP Network

Zhou

Franco

Bogo

et al. 2021

Computer Vision – ACCV 2020

View full text Add to dashboard Cite

We study the problem of reconstructing the template-aligned mesh for human body estimation from unstructured point cloud data. Recently proposed approaches for shape matching that rely on Deep Neural Networks (DNNs) achieve state-of-the-art results with generic pointwise architectures; but in doing so, they exploit much weaker human body shape and surface priors with respect to methods that explicitly model the body surface with 3D templates. We investigate the impact of adding back such stronger shape priors by proposing a novel dedicated human template matching process, which relies on a point-based, deep autoencoder architecture. We encode surface smoothness and shape coherence with a specialized Gaussian Process layer. Furthermore, we enforce global consistency and improve the generalization capabilities of the model by introducing an adversarial training phase. The choice of these elements is grounded on an extensive analysis of DNNs failure modes in widely used datasets like SURREAL and FAUST. We validate and evaluate the impact of our novel components on these datasets, showing a quantitative improvement over state-of-the-art DNN-based methods, and qualitatively better results.

show abstract

Learning to Dress 3D People in Generative Clothing

Cited by 306 publications

References 46 publications

PLACE: Proximity Learning of Articulation and Contact in 3D Environments

PLACE: Proximity Learning of Articulation and Contact in 3D Environments

Image-Guided Human Reconstruction via Multi-Scale Graph Transformation Networks

Reconstructing Human Body Mesh from Point Clouds by Adversarial GP Network

Contact Info

Product

Resources

About