2022
DOI: 10.1145/3527168
|View full text |Cite
|
Sign up to set email alerts
|

StyleFusion: Disentangling Spatial Segments in StyleGAN-Generated Images

Abstract: We present StyleFusion , a new mapping architecture for StyleGAN, which takes as input a number of latent codes and fuses them into a single style code. Inserting the resulting style code into a pre-trained StyleGAN generator results in a single harmonized image in which each semantic region is controlled by one of the input latent codes. Effectively, StyleFusion yields a disentangled representation of the image, providing fine-grained control over each region of… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2022
2022
2025
2025

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 19 publications
(8 citation statements)
references
References 31 publications
0
8
0
Order By: Relevance
“…For ameliorating edits in unexpected regions of an image, strategies for blending latent features have been an emerging theme in many recent papers [7,16,17,21]. [7,16,21] interpolate spatial features more explicitly.…”
Section: Feat (2022)mentioning
confidence: 99%
See 1 more Smart Citation
“…For ameliorating edits in unexpected regions of an image, strategies for blending latent features have been an emerging theme in many recent papers [7,16,17,21]. [7,16,21] interpolate spatial features more explicitly.…”
Section: Feat (2022)mentioning
confidence: 99%
“…[7,16,21] interpolate spatial features more explicitly. In contrast, StyleFusion [17] realizes similar objectives through blended latent code extracted using a fusion network that combines disjoint semantic attributes from multiple images into a single photorealistic image.…”
Section: Feat (2022)mentioning
confidence: 99%
“…Similarly, Abdal et al [AQW20] change the spatial activations to allow scribble‐level control for the user (see Section 4 for more details). StyleFusion [KPACO21] propose a new mapping architecture for StyleGAN to better disentangle a target attribute. This results in a learned blending between style codes, resulting in fine‐grained local control of the edited images.…”
Section: Latent Space Editingmentioning
confidence: 99%
“…Alternatively, Tritong et al [TRS21] directly use StyleGAN and f for segmentation by first inverting a real image into latent space. In the context of local editing, Collins et al [CBPS20] and Kafri et al [KPACO21] perform a simple clustering procedure over StyleGAN's internal representations to obtain a semantic segmentation of an input image. This semantic map can then be used to perform local editing over an image, guided by a target reference image.…”
Section: Discriminative Applicationsmentioning
confidence: 99%
“…The latent component responsible for synthesizing the target image region is then located via an optimization algorithm, and FAM can be achieved by modifying it according to the corresponding code of an exemplar image. StyleFusion [163] proposes a hierarchical fusion strategy to disentangle latent representations according to the decomposition patterns obtained by EIS. In addition, RIS [164] extends EIS to manipulate more challenging facial attributes which involve large geometric changes (e.g., hairstyle and pose).…”
Section: D Graphics Modelsmentioning
confidence: 99%