2019
DOI: 10.48550/arxiv.1903.12003
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

High Fidelity Face Manipulation with Extreme Poses and Expressions

Abstract: Face manipulation has shown remarkable advances with the flourish of Generative Adversarial Networks. However, due to the difficulties of controlling the structure and texture in high-resolution, it is challenging to simultaneously model pose and expression during manipulation. In this paper, we propose a novel framework that simplifies face manipulation with extreme pose and expression into two correlated stages: a boundary prediction stage and a disentangled face synthesis stage. In the first stage, we propo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
7
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
4
1

Relationship

2
3

Authors

Journals

citations
Cited by 5 publications
(7 citation statements)
references
References 70 publications
0
7
0
Order By: Relevance
“…Recently, the use of VAEs [11] and adversarial image-to-image translation networks have become extremely popular for editing facial expressions [44,106,24], with or without paired data for training. Some of these models use attention masks [73], facial shape information [33,31] or exemplar videos [83] to guide the model in this task.…”
Section: Related Workmentioning
confidence: 99%
“…Recently, the use of VAEs [11] and adversarial image-to-image translation networks have become extremely popular for editing facial expressions [44,106,24], with or without paired data for training. Some of these models use attention masks [73], facial shape information [33,31] or exemplar videos [83] to guide the model in this task.…”
Section: Related Workmentioning
confidence: 99%
“…Thus, several methods [26,36,39,41] use generators based on direct synthesis, where the image is generated using a sequence of convolutional operators, interleaved with elementwise non-linearities, and normalizations. Person identity information may be injected into such architecture, either with a lengthy learning process (in the many-shot scenario) [26,39] or by using adaptive normalizations conditioned on person embeddings [13,36,41]. The method [41] effectively combines both approaches by injecting identity through adaptive normalizations, and then fine-tuning the resulting generator on the fewshot learning set.…”
Section: Related Workmentioning
confidence: 99%
“…The first works required a video [26,39] or even multiple videos [28,35] to create a neural network that can synthesize talking head view of a person. Most recently, several works [13,17,33,33,36,37,41] presented systems that Existing few-shot neural head avatar systems achieve remarkable results. Yet, unlike some of the graphics-based avatars, the neural systems are too slow to be deployed on mobile devices and require a high-end desktop GPU to run in real-time.…”
Section: Introductionmentioning
confidence: 99%
“…Recently, the great progress of high-quality face synthesis [38,5,33,39] has made "recognition via generation" possible. TP-GAN [16] and CAPG-GAN [13] introduce face synthesis to improve the quantitative performance of large pose face recognition.…”
Section: Introductionmentioning
confidence: 99%