Guofeng Cui scite author profile

When people deliver a speech, they naturally move heads, and this rhythmic head motion conveys prosodic information. However, generating a lip-synced video while moving head naturally is challenging. While remarkably successful, existing works either generate still talkingface videos or rely on landmark/video frames as sparse/dense mapping guidance to generate head movements, which leads to unrealistic or uncontrollable video synthesis. To overcome the limitations, we propose a 3D-aware generative network along with a hybrid embedding module and a non-linear composition module. Through modeling the head motion and facial expressions 1 explicitly, manipulating 3D animation carefully, and embedding reference images dynamically, our approach achieves controllable, photo-realistic, and temporally coherent talking-head videos with natural head movements. Thoughtful experiments on several standard benchmarks demonstrate that our method achieves significantly better results than the state-of-the-art methods in both quantitative and qualitative comparisons. The code is available on https://github.com/ lelechen63/Talking-head-Generation-with-Rhythmic-Head-Motion.

show abstract

Improve CAM with Auto-adapted Segmentation and Co-supervised Augmentation

Kou

Cui

Wang

et al. 2021

View full text Add to dashboard Cite

Improve CAM with Auto-adapted Segmentation and Co-supervised Augmentation

Kou

Cui

Wang

et al. 2019

Preprint

View full text Add to dashboard Cite

Construction of Interactive Online Teaching System for English Speech Based on Web

Cheng¹,

Cui²

2023

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Guofeng Cui

Talking-Head Generation with Rhythmic Head Motion

Talking-head Generation with Rhythmic Head Motion

Improve CAM with Auto-adapted Segmentation and Co-supervised Augmentation

Improve CAM with Auto-adapted Segmentation and Co-supervised Augmentation

Construction of Interactive Online Teaching System for English Speech Based on Web

Contact Info

Product

Resources

About