Yonggen Ling scite author profile

We address the problem of recovering the 3D geometry of a human face from a set of facial images in multiple views. While recent studies have shown impressive progress in 3D Morphable Model (3DMM) based facial reconstruction, the settings are mostly restricted to a single view. There is an inherent drawback in the single-view setting: the lack of reliable 3D constraints can cause unresolvable ambiguities. We in this paper explore 3DMM-based shape recovery in a different setting, where a set of multi-view facial images are given as input. A novel approach is proposed to regress 3DMM parameters from multi-view inputs with an end-toend trainable Convolutional Neural Network (CNN). Multiview geometric constraints are incorporated into the network by establishing dense correspondences between different views leveraging a novel self-supervised view alignment loss. The main ingredient of the view alignment loss is a differentiable dense optical flow estimator that can backpropagate the alignment errors between an input view and a synthetic rendering from another input view, which is projected to the target view through the 3D shape to be inferred. Through minimizing the view alignment loss, better 3D shapes can be recovered such that the synthetic projections from one view to another can better align with the observed image. Extensive experiments demonstrate the superiority of the proposed method over other 3DMM methods.

show abstract

A Multi-Scale Guided Cascade Hourglass Network for Depth Completion

Yuan

Ling

et al. 2020

113

View full text Add to dashboard Cite

Left-Right Comparative Recurrent Model for Stereo Matching

Jie

Wang

Ling

et al. 2018

View full text Add to dashboard Cite

Leveraging the disparity information from both left and right views is crucial for stereo disparity estimation. Leftright consistency check is an effective way to enhance the disparity estimation by referring to the information from the opposite view. However, the conventional left-right consistency check is an isolated post-processing step and heavily hand-crafted. This paper proposes a novel left-right comparative recurrent model to perform left-right consistency checking jointly with disparity estimation. At each recurrent step, the model produces disparity results for both views, and then performs online left-right comparison to identify the mismatched regions which may probably contain erroneously labeled pixels. A soft attention mechanism is introduced, which employs the learned error maps for better guiding the model to selectively focus on refining the unreliable regions at the next recurrent step. In this way, the generated disparity maps are progressively improved by the proposed recurrent model. Extensive evaluations on KITTI 2015, Scene Flow and Middlebury benchmarks validate the effectiveness of our model, demonstrating that state-of-theart stereo disparity estimation results can be achieved by this new model.

show abstract

Self-Supervised Learning of Detailed 3D Face Reconstruction

Chen

Wang

et al. 2020

IEEE Trans. on Image Process.

View full text Add to dashboard Cite

Domain Adaptation Gaze Estimation by Embedding with Prediction Consistency

Guo

Yuan

Zhang

et al. 2021

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Yonggen Ling

MVF-Net: Multi-View 3D Face Morphable Model Regression

A Multi-Scale Guided Cascade Hourglass Network for Depth Completion

Left-Right Comparative Recurrent Model for Stereo Matching

Self-Supervised Learning of Detailed 3D Face Reconstruction

Domain Adaptation Gaze Estimation by Embedding with Prediction Consistency

Contact Info

Product

Resources

About