2020
DOI: 10.1609/aaai.v34i07.6970
|View full text |Cite
|
Sign up to set email alerts
|

Realistic Face Reenactment via Self-Supervised Disentangling of Identity and Pose

Abstract: Recent works have shown how realistic talking face images can be obtained under the supervision of geometry guidance, e.g., facial landmark or boundary. To alleviate the demand for manual annotations, in this paper, we propose a novel self-supervised hybrid model (DAE-GAN) that learns how to reenact face naturally given large amounts of unlabeled videos. Our approach combines two deforming autoencoders with the latest advances in the conditional generation. On the one hand, we adopt the deforming autoencoder t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
13
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
4
1

Relationship

1
9

Authors

Journals

citations
Cited by 40 publications
(14 citation statements)
references
References 19 publications
0
13
0
Order By: Relevance
“…The model, named ICface, is based on two generators: one for generating a neutral image from the target person and one for manipulating it as per the extracted non-identity attributes of the input face. As another example of these models, DAE-GAN [200] learns separate pose and identity representations for generating talking faces. A conditional GAN is then employed to perform the style transformation.…”
Section: B Video Generationmentioning
confidence: 99%
“…The model, named ICface, is based on two generators: one for generating a neutral image from the target person and one for manipulating it as per the extracted non-identity attributes of the input face. As another example of these models, DAE-GAN [200] learns separate pose and identity representations for generating talking faces. A conditional GAN is then employed to perform the style transformation.…”
Section: B Video Generationmentioning
confidence: 99%
“…Feature fusion is an important process in face swapping. Most previous works [12,28,29,47,50,53] are inspired by style transfer methods. They employ AdaIN [21] to inject the identity vector into the target face to generate swapped face.…”
Section: Feature Fusionmentioning
confidence: 99%
“…With the recent progress in computer vision and speech processing and the advent of generative adversarial networks, manipulated media has leaped a significant step forward. These advances have unleashed potential for harmful applications like face puppeteering [83,136], speech manipulation [97], face transfer [115], full body manipulation [23], and other more general media manipulations. Many of these manipulations can be used to impersonate others, spread false information, or just introduce bias in the observer.…”
Section: Computer Visionmentioning
confidence: 99%