Consistent Counterfactuals for Deep Models

Black, Emily; Wang, Zifan; Fredrikson, Matt; Datta, Anupam

doi:10.48550/arxiv.2110.03109

Cited by 3 publications

(6 citation statements)

References 28 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…However, this work focuses on small changes to the model, e.g., retraining on some data drawn from the same distribution, or minor changes to the hyperparameters, keeping the underlying data mostly similar. Such small changes to the model are in fact quite common in several applications and occur frequently in practice [12][13][14][15].…”

Section: Methodsmentioning

confidence: 99%

“…In [6,10,18], the authors argue that counterfactuals that lie on the data manifold are likely to be more robust than the closest counterfactuals, but the focus is more on generating counterfactuals that specifically lie on the data manifold (which may not always be sufficient for robustness). Despite researchers arguing that robustness is an important desideratum of local explanation methods [13], the problem of generating robust counterfactuals has been less explored, with the notable exceptions of some recent works [12,14,22]. In [12,14], the authors propose algorithms that aim to find the closest counterfactuals that are also robust (with demonstration on linear models and neural networks).…”

Section: Methodsmentioning

confidence: 99%

“…Despite researchers arguing that robustness is an important desideratum of local explanation methods [13], the problem of generating robust counterfactuals has been less explored, with the notable exceptions of some recent works [12,14,22]. In [12,14], the authors propose algorithms that aim to find the closest counterfactuals that are also robust (with demonstration on linear models and neural networks). In [22], the focus is on analytical trade-offs between validity and cost.…”

Section: Methodsmentioning

confidence: 99%

See 2 more Smart Citations

Robust Counterfactual Explanations for Tree-Based Ensembles

Dutta¹,

Long²,

Mishra³

et al. 2022

Preprint

View full text Add to dashboard Cite

Counterfactual explanations inform ways to achieve a desired outcome from a machine learning model. However, such explanations are not robust to certain real-world changes in the underlying model (e.g., retraining the model, changing hyperparameters, etc.), questioning their reliability in several applications, e.g., credit lending. In this work, we propose a novel strategy -that we call RobX -to generate robust counterfactuals for tree-based ensembles, e.g., XGBoost. Tree-based ensembles pose additional challenges in robust counterfactual generation, e.g., they have a non-smooth and non-differentiable objective function, and they can change a lot in the parameter space under retraining on very similar data. We first introduce a novel metric -that we call Counterfactual Stability -that attempts to quantify how robust a counterfactual is going to be to model changes under retraining, and comes with desirable theoretical properties. Our proposed strategy RobX works with any counterfactual generation method (base method) and searches for robust counterfactuals by iteratively refining the counterfactual generated by the base method using our metric Counterfactual Stability. We compare the performance of RobX with popular counterfactual generation methods (for tree-based ensembles) across benchmark datasets. The results demonstrate that our strategy generates counterfactuals that are significantly more robust (nearly 100% validity after actual model changes) and also realistic (in terms of local outlier factor) over existing state-of-the-art methods.How do we generate counterfactuals for tree-based ensembles that are not only close but also robust to changes in the model?

show abstract

Section: Methodsmentioning

confidence: 99%

Section: Methodsmentioning

confidence: 99%

Section: Methodsmentioning

confidence: 99%

See 1 more Smart Citation

Robust Counterfactual Explanations for Tree-Based Ensembles

Dutta¹,

Long²,

Mishra³

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

“…Prior works have focused on determining the extent to which recourses remain robust to the choice of the underlying model (Pawelczyk et al, 2020b;Black et al, 2021), shifts or changes in the underlying models (Rawal et al, 2021;Upadhyay et al, 2021), or small perturbations to the input instances (Artelt et al, 2021;Dominguez-Olmedo et al, 2021;Slack et al, 2021).…”

Section: Robustness Of Algorithmic Recoursementioning

confidence: 99%

Algorithmic Recourse in the Face of Noisy Human Responses

Pawelczyk¹,

Datta²,

van-den-Heuvel³

et al. 2022

Preprint

View full text Add to dashboard Cite

As machine learning (ML) models are increasingly being deployed in high-stakes applications, there has been growing interest in providing recourse to individuals adversely impacted by model predictions (e.g., an applicant whose loan has been denied). To this end, several post hoc techniques have been proposed in recent literature. These techniques generate recourses under the assumption that the affected individuals will implement the prescribed recourses exactly. However, recent studies suggest that individuals often implement recourses in a noisy and inconsistent manner -e.g., raising their salary by $505 if the prescribed recourse suggested an increase of $500. Motivated by this, we introduce and study the problem of recourse invalidation in the face of noisy human responses. More specifically, we theoretically and empirically analyze the behavior of state-of-the-art algorithms, and demonstrate that the recourses generated by these algorithms are very likely to be invalidated if small changes are made to them. We further propose a novel framework, EXPECTing noisy responses (EXPECT), which addresses the aforementioned problem by explicitly minimizing the probability of recourse invalidation in the face of noisy responses. Experimental evaluation with multiple real world datasets demonstrates the efficacy of the proposed framework, and supports our theoretical findings.

show abstract

“…lead to favourable classification outcomes) for all plausible individuals similar to the individual seeking recourse. We refer to this notion of robustness as the adversarial robustness of recourse, in order to distinguish it from other robustness considerations previously studied in the recourse literature (e.g., robustness with respect to changes to the decision-making classifier [29,36,3]), and as a reference to the adversarial robustness literature, which considers robustness of prediction precisely against uncertainty in the features of the data.…”

Section: Introductionmentioning

confidence: 99%

On the Adversarial Robustness of Causal Algorithmic Recourse

Dominguez-Olmedo¹,

Karimi²,

Schölkopf³

2021

Preprint

View full text Add to dashboard Cite

Algorithmic recourse seeks to provide actionable recommendations for individuals to overcome unfavorable outcomes made by automated decision-making systems. Recourse recommendations should ideally be robust to reasonably small uncertainty in the features of the individual seeking recourse. In this work, we formulate the adversarially robust recourse problem and show that recourse methods offering minimally costly recourse fail to be robust. We then present methods for generating adversarially robust recourse in the linear and in the differentiable case. To ensure that recourse is robust, individuals are asked to make more effort than they would have otherwise had to. In order to shift part of the burden of robustness from the decision-subject to the decision-maker, we propose a model regularizer that encourages the additional cost of seeking robust recourse to be low. We show that classifiers trained with our proposed model regularizer, which penalizes relying on unactionable features for prediction, offer potentially less effortful recourse.Preprint. Under review.

show abstract

Consistent Counterfactuals for Deep Models

Cited by 3 publications

References 28 publications

Robust Counterfactual Explanations for Tree-Based Ensembles

Robust Counterfactual Explanations for Tree-Based Ensembles

Algorithmic Recourse in the Face of Noisy Human Responses

On the Adversarial Robustness of Causal Algorithmic Recourse

Contact Info

Product

Resources

About