2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2022
DOI: 10.1109/wacv51458.2022.00350
|View full text |Cite
|
Sign up to set email alerts
|

StyleMC: Multi-Channel Based Fast Text-Guided Image Generation and Manipulation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
31
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
3
1

Relationship

2
6

Authors

Journals

citations
Cited by 47 publications
(32 citation statements)
references
References 13 publications
1
31
0
Order By: Relevance
“…However, techniques using the StyleSpace [61], e.g. StyleMC [31], tend to yield better results in our experiments confirming the findings of concurrent work by Alaluf et al [4]. Furthermore, we remark that for applications where equivariance is not essential, our framework can easily be used with StyleGAN2 layers instead.…”
Section: Limitations and Future Worksupporting
confidence: 90%
See 1 more Smart Citation
“…However, techniques using the StyleSpace [61], e.g. StyleMC [31], tend to yield better results in our experiments confirming the findings of concurrent work by Alaluf et al [4]. Furthermore, we remark that for applications where equivariance is not essential, our framework can easily be used with StyleGAN2 layers instead.…”
Section: Limitations and Future Worksupporting
confidence: 90%
“…6 (Left), we first invert a given source image via latent space optimization. We then apply different manipulation directions obtained by GANspace [20] and StyleMC [31]. Prior work [23] also investigates in-plane translation.…”
Section: Inversion and Manipulationmentioning
confidence: 99%
“…A more recent work, LatentCLR [60], proposes a contrastive learning approach to find unsupervised directions that are transferable to different classes. Moreover, StyleCLIP [40] and StyleMC [34] both propose using CLIP for text-guided manipulation of both randomly generated and encoded images with StyleGAN2. These methods show that it is possible to use CLIP for fine-grained and disentangled manipulations of images.…”
Section: Latent Space Manipulationmentioning
confidence: 99%
“…Similarly, [1] uses conditional continuous normalization flows to perform supervised attribute processing in the latent space of StyleGAN2. Recently, text-based manipulation methods have been proposed [15,21], which use CLIP [24] to perform fine-grained and disentangled manipulations of images.…”
Section: Latent Space Manipulationmentioning
confidence: 99%
“…However, most previous work on the manipulation of latent space has focused on the discovery of a single direction by which the desired editing can be achieved, such as zoom-in [6,11] or altering facial features, such as applying a lipstick [15,34]. In this work, we focus on the discovery of multiple and diverse directions that can achieve the desired edit.…”
Section: Introductionmentioning
confidence: 99%