2021
DOI: 10.48550/arxiv.2106.05744
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Pivotal Tuning for Latent-based Editing of Real Images

Abstract: Recently, a surge of advanced facial editing techniques have been proposed that leverage the generative power of a pre-trained StyleGAN. To successfully edit an image this way, one must first project (or invert) the image into the pre-trained generator's domain. As it turns out, however, StyleGAN's latent space induces an inherent tradeoff between distortion and editability, i.e. between maintaining the original appearance and convincingly altering some of its attributes. Practically, this means it is still ch… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
134
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 40 publications
(134 citation statements)
references
References 35 publications
0
134
0
Order By: Relevance
“…In addition, there are other two-stage approaches but are not hybrid ones. For example, PIE [44] uses optimization methods on both phases, while PTI [39] combines optimization process with the generator fine-tuning technique. In comparison, our work is also a two-stage method but differs from the above ones in that our approach is only based purely on the encoder-based manner in both phases.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…In addition, there are other two-stage approaches but are not hybrid ones. For example, PIE [44] uses optimization methods on both phases, while PTI [39] combines optimization process with the generator fine-tuning technique. In comparison, our work is also a two-stage method but differs from the above ones in that our approach is only based purely on the encoder-based manner in both phases.…”
Section: Related Workmentioning
confidence: 99%
“…Our goal in this phase is to recover the missing information of input x and thus reduce the discrepancy between x and xw . Other two-stage methods use either per-image optimization process [44] or per-image fine-tuning G [39] to improve reconstruction quality further. However, the drawback of such these approaches is the slow inference time.…”
Section: Phase Ii: Generator Refinement Via Hypernet-mentioning
confidence: 99%
See 1 more Smart Citation
“…Although traversing the latent space of unconditional GANs can achieve image editing in closed-domain images such as faces, its incapability of generating real-world images (e.g., multiple objects and complex scenes) limits their generalization and application. In addition, since their hidden spaces need to retain all the information of the generated outputs, the inversion [54] of an open-domain image is usually compromised for photo fidelity [1,34]. In contrast, the editing space of our proposed model does not have such limitations.…”
Section: Related Workmentioning
confidence: 99%
“…One related work is styleGAN [19], which is trained to generate realistic images for closed-domain categories such as faces, cats, and cars. Since then, a series of manipulation works [6,34,35,41,44,45] have been built upon styleGAN by inverting a given image to its latent space and then manipulating the latent code to generate a new image while keeping the generator intact.…”
Section: Introductionmentioning
confidence: 99%