Explaining by Removing: A Unified Framework for Model Explanation

Covert, Ian; Lundberg, Scott; Lee, Su-In

doi:10.48550/arxiv.2011.14878

Cited by 21 publications

(39 citation statements)

References 62 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…where the held out features x 1−s are marginalized out using their joint marginal distribution p(x 1−s ) and a link function (e.g., logit) is applied to the model output. Recent work has debated the properties of different value function formulations, particularly the choice of how to remove features [1,19,9,14]. Regardless of the formulation, this approach to model explanation enjoys several useful theoretical properties due to the use of Shapley values: for example, they are zero for irrelevant features and are guaranteed to sum to the model's prediction.…”

Section: Shapley Valuesmentioning

confidence: 99%

“…First, many works have proposed stochastic estimators [3,40,39,27,10,44] that rely on sampling either feature subsets or permutations; these are often consistent estimators, but they require many model evaluations and involve a trade-off between run-time and accuracy. Second, some works have proposed model-specific approximations, e.g., for trees [26] or neural networks [35,6,2,43]; these are generally faster, but they sometimes require many model evaluations, often induce bias, and typically lack flexibility regarding how to handle held-out features when generating explanations-a subject of continued debate in the field [1,19,9,14].…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

FastSHAP: Real-Time Shapley Value Estimation

Jethani¹,

Sudarshan²,

Covert³

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

Shapley values are widely used to explain black-box models, but they are costly to calculate because they require many model evaluations. We introduce FastSHAP, a method for estimating Shapley values in a single forward pass using a learned explainer model. FastSHAP amortizes the cost of explaining many inputs via a learning approach inspired by the Shapley value's weighted least squares characterization, and it can be trained using standard stochastic gradient optimization. We compare FastSHAP to existing estimation approaches, revealing that it generates high-quality explanations with orders of magnitude speedup. * Equal contribution 1 https://git.io/JCqFV (PyTorch), https://git.io/JCqbP (TensorFlow) Preprint. Under review.

show abstract

Section: Shapley Valuesmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

FastSHAP: Real-Time Shapley Value Estimation

Jethani¹,

Sudarshan²,

Covert³

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…4. We remark that one of the most important details of any explanation method based on feature removal is the baseline, which defines the value that X C takes in the entries not in C. There are different approaches to removing features, ranging from using the default value of 0, to using their conditional distribution (refer to [9] for further details). Computing the latter can be challenging, and recent work has explored various approximations [1,11].…”

Section: Explaining Predictions Via Shapley Coefficientsmentioning

confidence: 99%

“…as baseline), h-Shap uses their expected value (or unconditional distribution [20]) for simplicity, as done by other works [9]. As pointed out by [9,27], this is valid under the assumptions of model linearity and feature independence 3 . Yet, as we will argue later in Sec.…”

Section: Hierarchical-shapmentioning

confidence: 99%

See 1 more Smart Citation

Fast Hierarchical Games for Image Explanations

Teneggi,

Luster,

Sulam

2021

Preprint

View full text Add to dashboard Cite

As modern complex neural networks keep breaking records and solving harder problems, their predictions also become less and less intelligible. The current lack of interpretability often undermines the deployment of accurate machine learning tools in sensitive settings. In this work, we present a model-agnostic explanation method for image classification based on a hierarchical extension of Shapley coefficients -Hierarchical Shap (h-Shap)-that resolves some of the limitations of current approaches. Unlike other Shapley-based explanation methods, h-Shap is scalable and can be computed without the need of approximation. Under certain distributional assumptions, such as those common in multiple instance learning, h-Shap retrieves the exact Shapley coefficients with an exponential improvement in computational complexity. We compare our hierarchical approach with popular Shapley-based and non-Shapley-based methods on a synthetic dataset, a medical imaging scenario, and a general computer vision problem, showing that h-Shap outperforms the state of the art in both accuracy and runtime. Code and experiments are made publicly available.

show abstract