FML: Face Model Learning From Videos

Tewari, Ayush; Bernard, Florian; Garrido, Pablo; Bharaj, Gaurav; Elgharib, Mohamed; Seidel, Hans‐Peter; Pérez, Patrick; Zollhöfer, Michael; Theobalt, Christian

doi:10.1109/cvpr.2019.01107

Cited by 172 publications

(163 citation statements)

References 71 publications

Supporting

Mentioning

162

Contrasting

Order By: Relevance

“…. Several other works have shown that combining a prior template about the object category shape with video allows for an improved 3D reconstruction of the underlying geometry, both for faces [67,63,43] and quadrupeds [8]. However, these methods still require multiple videos and a template, while our method does not.…”

Section: Previous Workmentioning

confidence: 92%

See 1 more Smart Citation

Lifting AutoEncoders: Unsupervised Learning of a Fully-Disentangled 3D Morphable Model Using Deep Non-Rigid Structure From Motion

Sahasrabudhe

Shu

Bartrum

et al. 2019

2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW)

View full text Add to dashboard Cite

Figure 1: We introduce Lifting AutoEncoders, a deep generative model of 3D shape variability that is learned from an unstructured photo collection without supervision. Having access to 3D allows us to disentangle the effects of viewpoint, non-rigid shape (due to identity/expression), illumination and albedo and perform entirely controllable image synthesis. AbstractIn this work we introduce Lifting Autoencoders, a generative 3D surface-based model of object categories. We bring together ideas from non-rigid structure from motion, image formation, and morphable models to learn a controllable, geometric model of 3D categories in an entirely unsupervised manner from an unstructured set of images. We exploit the 3D geometric nature of our model and use normal information to disentangle appearance into illumination, shading and albedo. We further use weak supervision to disentangle the non-rigid shape variability of human faces into identity and expression. We combine the 3D representation with a differentiable renderer to generate RGB images and append an adversarially trained refinement network to obtain sharp, photorealistic image reconstruction * Indicating equal contributions.results. The learned generative model can be controlled in terms of interpretable geometry and appearance factors, allowing us to perform photorealistic image manipulation of identity, expression, 3D pose, and illumination properties.

show abstract

Section: Previous Workmentioning

confidence: 92%

“…Effectively all works addressing aspects related to 3D geometry rely on paired data for training, e.g. multiple views of the same object [71], videos [48] or some pre-existing 3D mesh representation that is the starting point for further disentanglement [21,56,81,62] or self-supervision [85].…”

Section: Previous Workmentioning

confidence: 99%

Lifting AutoEncoders: Unsupervised Learning of a Fully-Disentangled 3D Morphable Model Using Deep Non-Rigid Structure From Motion

Sahasrabudhe

Shu

Bartrum

et al. 2019

2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW)

View full text Add to dashboard Cite

show abstract

“…Recently, several deep learning-based models were published that fall into this group of nonlinear models [Bagautdinov et al 2018;Lombardi et al 2018;Tewari et al 2019Tran and Liu 2018a]. Section 6 covers these models in more detail.…”

Section: Multiplicative Modelsmentioning

confidence: 99%

“…The rise of deep learning methods facilitated to learn per-vertex appearance models directly from images, such as done by , who learn per-vertex albedo model offsets in order to improve the generalization ability of an existing PCA-based model. Similarly, Tewari et al [2019], learn a per-vertex albedo model from scratch based on video data. train a mesh decoder that jointly models the texture and shape on a per-vertex basis, which, however, relies on the availability of 3D shape and appearance data.…”

Section: Nonlinear Modelsmentioning

confidence: 99%

3D Morphable Face Models—Past, Present, and Future

et al. 2020

Self Cite

View full text Add to dashboard Cite

In this article, we provide a detailed survey of 3D Morphable Face Models over the 20 years since they were first proposed. The challenges in building and applying these models, namely, capture, modeling, image formation, and image analysis, are still active research topics, and we review the state-of-the-art in each of these areas. We also look ahead, identifying unsolved challenges, proposing directions for future research, and highlighting the broad range of current and future applications.

show abstract

“…Hence, in this work, we learn from complete texture maps obtained from 3D registrations. 3D person reconstruction from images While promising, recent methods for 3D person reconstruction either require video as input [6,7,8], scans [74], do not allow control over pose, shape and clothing [48,56], focus only on faces [72,32,63,57,47,62], or only on garments [68].…”

Section: Related Workmentioning

confidence: 99%

360-Degree Textures of People in Clothing from a Single Image

Lazova

Insafutdinov

Pons-Moll

2019

2019 International Conference on 3D Vision (3DV)

148

109

View full text Add to dashboard Cite

DensePose Garment segmentation Partial texture Completed texture Partial segmentation Completed segmentation Displacement maps Input view Fully-textured 3D avatar Figure 1: Given a single view of a person we predict a complete texture map in the UV space, complete clothing segmentation as well as a displacement map for the SMPL model [41], which we then combine to obtain a fully-textured 3D avatar. AbstractIn this paper we predict a full 3D avatar of a person from a single image. We infer texture and geometry in the UVspace of the SMPL model using an image-to-image translation method. Given partial texture and segmentation layout maps derived from the input view, our model predicts the complete segmentation map, the complete texture map, and a displacement map. The predicted maps can be applied to the SMPL model in order to naturally generalize to novel poses, shapes, and even new clothing. In order to learn our model in a common UV-space, we non-rigidly register the SMPL model to thousands of 3D scans, effectively encoding textures and geometries as images in correspondence. This turns a difficult 3D inference task into a simpler image-toimage translation one. Results on rendered scans of people and images from the DeepFashion dataset demonstrate that our method can reconstruct plausible 3D avatars from a single image. We further use our model to digitally change pose, shape, swap garments between people and edit clothing. To encourage research in this direction we will make the source code available for research purpose [5].

show abstract

FML: Face Model Learning From Videos

Cited by 172 publications

References 71 publications

Lifting AutoEncoders: Unsupervised Learning of a Fully-Disentangled 3D Morphable Model Using Deep Non-Rigid Structure From Motion

Lifting AutoEncoders: Unsupervised Learning of a Fully-Disentangled 3D Morphable Model Using Deep Non-Rigid Structure From Motion

3D Morphable Face Models—Past, Present, and Future

360-Degree Textures of People in Clothing from a Single Image

Contact Info

Product

Resources

About