Explaining in Style: Training a GAN to explain a classifier in StyleSpace

Lang, Oran; Gandelsman, Yossi; Yarom, Michal; Wald, Yoav; Elidan, Gal; Hassidim, Avinatan; Freeman, William T.; Isola, Phillip; Globerson, Amir; Irani, Michal; Mosseri, Inbar

doi:10.48550/arxiv.2104.13369

Cited by 14 publications

(15 citation statements)

References 34 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Indeed, for some attributes, even finding a disentangled latent direction is infeasible. Furthermore, similar to other methods which rely on StyleGAN [7,29], our method obtains better results when operating on images within the domain used to train the GAN. This limitation stems in part from to the inability of current GAN Inversion methods to reconstruct out-of-domain images while preserving latent semantics.…”

Section: Discussionmentioning

confidence: 52%

“…In the context of discriminative tasks, several recent methods have proposed to utilize GANs for additional purposes. Lang et al [29] used StyleGAN [27] to visualize counterfactual examples for explaining a pretrained classifier's predictions. Chai et al [7] used style-mixing in the fine-layers of StyleGAN to generate augmentations that are ensembled together at test-time.…”

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

LARGE: Latent-Based Regression through GAN Semantics

Nitzan¹,

Gal²,

Ofir³

et al. 2021

Preprint

View full text Add to dashboard Cite

We propose a novel method for solving regression tasks using few-shot or weak supervision. At the core of our method is the fundamental observation that GANs are incredibly successful at encoding semantic information within their latent space, even in a completely unsupervised setting. For modern generative frameworks, this semantic encoding manifests as smooth, linear directions which affect image attributes in a disentangled manner. These directions have been widely used in GAN-based image editing. We show that such directions are not only linear, but that the magnitude of change induced on the respective attribute is approximately linear with respect to the distance traveled along them. By leveraging this observation, our method turns a pre-trained GAN into a regression model, using as few as two labeled samples. This enables solving regression tasks on datasets and attributes which are difficult to produce quality supervision for. Additionally, we show that the same latent-distances can be used to sort collections of images by the strength of given attributes, even in the absence of explicit supervision. Extensive experimental evaluations demonstrate that our method can be applied across a wide range of domains, leverage multiple latent direction discovery frameworks, and achieve state-of-the-art results in few-shot and low-supervision settings, even when compared to methods designed to tackle a single task. * Indicates equal contribution Preprint. Under review.

show abstract

Section: Discussionmentioning

confidence: 52%

Section: Related Workmentioning

confidence: 99%

LARGE: Latent-Based Regression through GAN Semantics

Nitzan¹,

Gal²,

Ofir³

et al. 2021

Preprint

View full text Add to dashboard Cite

show abstract

“…In the first step, we jointly train a generator G and an encoder E. We use the popular Style-GAN2 generator [16], and train it to produce realistic images using the original objective as have been employed by Karras et al [12,15,16], denoted L GAN . The encoder is trained to reconstruct both real and synthesized images, similar to the work of Lang et al [19]. Let G n (z) be an unconditionally generated image from normally distributed noise vector z, we denote its reconstruction as G(E(G n (z))), where G(w) refers to applying the generator over latent code w (i.e.…”

Section: Generative-based Self-filteringmentioning

confidence: 99%

Self-Distilled StyleGAN: Towards Generation from Internet Photos

Mokady¹,

Yarom²,

Tov³

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

StyleGAN is known to produce high-fidelity images, while also offering unprecedented semantic editing. However, these fascinating abilities have been demonstrated only on a limited set of datasets, which are usually structurally aligned and well curated. In this paper, we show how StyleGAN can be adapted to work on raw uncurated images collected from the Internet. Such image collections impose two main challenges to StyleGAN: they contain many outlier images, and are characterized by a multi-modal distribution. Training StyleGAN on such raw image collections results in degraded image synthesis quality. To meet these challenges, we proposed a StyleGAN-based self-distillation approach, which consists of two main components: (i) A generative-based self-filtering of the dataset to eliminate outlier images, in order to generate an adequate training set, and (ii) Perceptual clustering of the generated images to detect the inherent data modalities, which are then employed to improve StyleGAN's "truncation trick" in the image synthesis process. The presented technique enables the generation of high-quality images, while minimizing the loss in diversity of the data. Through qualitative and quantitative evaluation, we demonstrate the power of our approach to new challenging and diverse domains collected from the Internet. New datasets and pre-trained models are available in our project website 1 .

show abstract

“…GANalyze [8] takes advantages of the GAN-based model to visualize what a CNN model learns about high-level cognitive properties. StylEx [19] proposes to incorporate the classifier into the training process of StyleGAN and learn a classifier-specific StyleSpace. Sauer and Geiger [26] propose to disentangle object shape, object texture and background in the image generation process and generate structured conterfacturals which help improve the robustness and interpretability of classifiers.…”

Section: Generative Counterfactual Imagesmentioning

confidence: 99%

“…On one hand, it is difficult to sample valid counterfactuals from high-dimensional image manifold by simply altering pixels. To address this difficulty, some recent studies consider to exploit image generation techniques to produce counterfactuals [7,19,30]. On the other hand, even though image counterfactuals can be sampled using generators, it is sometimes still difficult to explain what attributes or concepts are altered.…”

Section: Introductionmentioning

confidence: 99%

Which Style Makes Me Attractive? Interpretable Control Discovery and Counterfactual Explanation on StyleGAN

Li¹,

Wang²,

Pei³

et al. 2022

Preprint

View full text Add to dashboard Cite

The semantically disentangled latent subspace in GAN provides rich interpretable controls in image generation. This paper includes two contributions on semantic latent subspace analysis in the scenario of face generation using StyleGAN2. First, we propose a novel approach to disentangle latent subspace semantics by exploiting existing face analysis models, e.g., face parsers and face landmark detectors. These models provide the flexibility to construct various criterions with very concrete and interpretable semantic meanings (e.g., change face shape or change skin color) to restrict latent subspace disentanglement. Rich latent space controls unknown previously can be discovered using the constructed criterions. Second, we propose a new perspective to explain the behavior of a CNN classifier by generating counterfactuals in the interpretable latent subspaces we discovered. This explanation helps reveal whether the classifier learns semantics as intended. Experiments on various disentanglement criterions demonstrate the effectiveness of our approach. We believe this approach contributes to both areas of image manipulation and counterfactual explainability of CNNs. The code is available at https://github.com/prclibo/ice.

show abstract

Explaining in Style: Training a GAN to explain a classifier in StyleSpace

Cited by 14 publications

References 34 publications

LARGE: Latent-Based Regression through GAN Semantics

LARGE: Latent-Based Regression through GAN Semantics

Self-Distilled StyleGAN: Towards Generation from Internet Photos

Which Style Makes Me Attractive? Interpretable Control Discovery and Counterfactual Explanation on StyleGAN

Contact Info

Product

Resources

About