2020
DOI: 10.48550/arxiv.2011.03623
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Feature Removal Is a Unifying Principle for Model Explanation Methods

Abstract: Researchers have proposed a wide variety of model explanation approaches, but it remains unclear how most methods are related or when one method is preferable to another. We examine the literature and find that many methods are based on a shared principle of explaining by removing-essentially, measuring the impact of removing sets of features from a model. These methods vary in several respects, so we develop a framework for removal-based explanations that characterizes each method along three dimensions: 1) h… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
15
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 10 publications
(15 citation statements)
references
References 35 publications
0
15
0
Order By: Relevance
“…As illustration, both SHAP [70] and L2X [20] assign relevance scores to single features, however, while SHAP expresses relevance in terms of marginalized contributions of features across all possible subsets, L2X encodes relevance as a notion of informativeness on the response variable through maximizing mutual information. We refer to [12,26,104] for surveys and further details on selection methods.…”
Section: Feature-based Explanationsmentioning
confidence: 99%
“…As illustration, both SHAP [70] and L2X [20] assign relevance scores to single features, however, while SHAP expresses relevance in terms of marginalized contributions of features across all possible subsets, L2X encodes relevance as a notion of informativeness on the response variable through maximizing mutual information. We refer to [12,26,104] for surveys and further details on selection methods.…”
Section: Feature-based Explanationsmentioning
confidence: 99%
“…In contrast, EP is a representative for perturbation-based methods [7,18,26]. EP searches for a set of a% of input pixels that maximizes the target-class confidence score.…”
Section: Gradcam and Extremal Perturbation (Ep)mentioning
confidence: 99%
“…Since 2013 [56], hundreds of research papers have either used attribution methods or proposed new ones [18,19]. Yet, it remains largely unknown how effective state-of-the-art AMs are in improving the performance of human-AI team on computer vision tasks.…”
Section: Introductionmentioning
confidence: 99%
“…causing black/grey dots in images). Blurring the input image only removed high-frequency signals, but did not remove low-frequency signals [3,27]. To this end, instead of setting specific baseline values, Covert et al [4] determined baseline values conditionally depending on the neighboring contexts.…”
Section: Introductionmentioning
confidence: 99%