2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2021
DOI: 10.1109/iccv48922.2021.00073
|View full text |Cite
|
Sign up to set email alerts
|

Explaining in Style: Training a GAN to explain a classifier in StyleSpace

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
38
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5
5

Relationship

0
10

Authors

Journals

citations
Cited by 92 publications
(38 citation statements)
references
References 18 publications
0
38
0
Order By: Relevance
“…The majority of work on interpretability so far has focused on (i), providing post-hoc explanations for a given prediction model. These include pixel attribution methods [Simonyan et al, 2014, Bach et al, 2015, Selvaraju et al, 2017, counterfactual explanations [Chang et al, 2019, Antoran et al, 2021, explanations based on pre-defined concepts , Kazhdan et al, 2020, Yeh et al, 2020, and recently developed StyleGANs [Wu et al, 2021, Lang et al, 2021. Post-hoc methods have a number of shortcomings given our desired objectives: First, it is unclear whether post-hoc explanations indeed reflect the black-box model's true "reasoning" [Rudin, 2018.…”
Section: Post-hoc Methodsmentioning
confidence: 99%
“…The majority of work on interpretability so far has focused on (i), providing post-hoc explanations for a given prediction model. These include pixel attribution methods [Simonyan et al, 2014, Bach et al, 2015, Selvaraju et al, 2017, counterfactual explanations [Chang et al, 2019, Antoran et al, 2021, explanations based on pre-defined concepts , Kazhdan et al, 2020, Yeh et al, 2020, and recently developed StyleGANs [Wu et al, 2021, Lang et al, 2021. Post-hoc methods have a number of shortcomings given our desired objectives: First, it is unclear whether post-hoc explanations indeed reflect the black-box model's true "reasoning" [Rudin, 2018.…”
Section: Post-hoc Methodsmentioning
confidence: 99%
“…This semantic map can then be used to perform local editing over an image, guided by a target reference image. Lang et al [2021] propose to not only exploit the emerging disentanglement properties of a pretrained StyleGAN, but to train a StyleGAN model for a specific disentangled axis. Through a clever training scheme, combining training StyleGAN along with a classifier for binary or multi-class recognition (e.g., a cat vs. dog classifier), they drive the latent space to capture classifier-specific attributes.…”
Section: Discriminative Applicationsmentioning
confidence: 99%
“…Related work focuses on understanding if neural networks encode and use concepts (Lucieri et al, 2020;Kim et al, 2018;McGrath et al, 2021), or generate counterfactual explanations to understand model behavior (Ghandeharioun et al, 2021;Abid et al, 2021;Akula et al, 2020). These works mostly use a set of human-specified concepts to analyze model behavior, however, there is an increasing interest in automatically discovering the concepts that are used by a model (Yeh et al, 2020;Ghorbani et al, 2019;Lang et al, 2021).…”
Section: Related Workmentioning
confidence: 99%