Xiaoming Liu scite author profile

Pedestrian detection is a critical problem in computer vision with significant impact on safety in urban autonomous driving. In this work, we explore how semantic segmentation can be used to boost pedestrian detection accuracy while having little to no impact on network efficiency. We propose a segmentation infusion network to enable joint supervision on semantic segmentation and pedestrian detection. When placed properly, the additional supervision helps guide features in shared layers to become more sophisticated and helpful for the downstream pedestrian detector. Using this approach, we find weakly annotated boxes to be sufficient for considerable performance gains. We provide an in-depth analysis to demonstrate how shared layers are shaped by the segmentation supervision. In doing so, we show that the resulting feature maps become more semantically meaningful and robust to shape and occlusion. Overall, our simultaneous detection and segmentation framework achieves a considerable gain over the state-of-the-art on the Caltech pedestrian dataset, competitive performance on KITTI, and executes 2× faster than competitive methods.

show abstract

Towards Large-Pose Face Frontalization in the Wild

Yin

Sohn

et al. 2017

328

248

View full text Add to dashboard Cite

Despite recent advances in face recognition using deep learning, severe accuracy drops are observed for large pose variations in unconstrained environments. Learning poseinvariant features is one solution, but needs expensively labeled large-scale data and carefully designed feature learning algorithms. In this work, we focus on frontalizing faces in the wild under various head poses, including extreme profile views. We propose a novel deep 3D Morphable Model (3DMM) conditioned Face Frontalization Generative Adversarial Network (GAN), termed as FF-GAN, to generate neutral head pose face images. Our framework differs from both traditional GANs and 3DMM based modeling. Incorporating 3DMM into the GAN structure provides shape and appearance priors for fast convergence with less training data, while also supporting end-to-end training. The 3DMMconditioned GAN employs not only the discriminator and generator loss but also a new masked symmetry loss to retain visual quality under occlusions, besides an identity loss to recover high frequency information. Experiments on face recognition, landmark localization and 3D reconstruction consistently show the advantage of our frontalization method on faces in the wild datasets. 1 * This work was supported by a research gift from NEC Labs to Michigan State University.1 Detail results and resources can be refered to: http://cvlab.cse. msu.edu/project-face-frontalization.html. 3DMM Coefficients Pose-Variant Input Recogni8on Engine Frontalized Output Generator FF-GAN D Discriminator Extreme Pose Input Frontalized Output

show abstract

CurricularFace: Adaptive Curriculum Learning Loss for Deep Face Recognition

et al. 2020

View full text Add to dashboard Cite

Large-Pose Face Alignment via CNN-Based Dense 3D Model Fitting

2016

View full text Add to dashboard Cite

Deep Facial Non-Rigid Multi-View Stereo

et al. 2020

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Xiaoming Liu

Illuminating Pedestrians via Simultaneous Detection and Segmentation

Towards Large-Pose Face Frontalization in the Wild

CurricularFace: Adaptive Curriculum Learning Loss for Deep Face Recognition

Large-Pose Face Alignment via CNN-Based Dense 3D Model Fitting

Deep Facial Non-Rigid Multi-View Stereo

Contact Info

Product

Resources

About