2019
DOI: 10.21105/joss.02027
|View full text |Cite
|
Sign up to set email alerts
|

shapr: An R-package for explaining machine learning models with dependence-aware Shapley values

Abstract: A common task within machine learning is to train a model to predict an unknown outcome (response variable) based on a set of known input variables/features. When using such models for real life applications, it is often crucial to understand why a certain set of features lead to a specific prediction. Most machine learning models are, however, complicated and hard to understand, so that they are often viewed as "black-boxes", that produce some output from some input.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
15
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
3
1
1

Relationship

2
7

Authors

Journals

citations
Cited by 22 publications
(15 citation statements)
references
References 7 publications
0
15
0
Order By: Relevance
“…On top of already published packages, such as shapper (March 2019) and fastshap (November 2019) there are new, recently created tools that await their publication on CRAN. shapr (Sellereite and Jullum, 2019), treeshap (2020), SHAPforxgboost (2020 are examples of such new packages. Not to mention about other Predict Parts Explanation Methods like iBreakDown.…”
Section: Discussionmentioning
confidence: 99%
“…On top of already published packages, such as shapper (March 2019) and fastshap (November 2019) there are new, recently created tools that await their publication on CRAN. shapr (Sellereite and Jullum, 2019), treeshap (2020), SHAPforxgboost (2020 are examples of such new packages. Not to mention about other Predict Parts Explanation Methods like iBreakDown.…”
Section: Discussionmentioning
confidence: 99%
“…For (2), it would likely be fruitful to explore alternative approaches based on Shapley additive explanations (SHAP; Lundberg & Lee, 2017). SHAP is a game theoretic method for explaining fitted classifiers' predictions and has several extensions that help prevent its performance from degrading in the presence of multicollinearity (Aas, Jullum, & Løland, 2021;Basu & Maji, 2020;Sellereite & Jullum, 2020). We note, however, that SHAP is less computationally efficient than PI, potentially hampering its application to very large-scale data.…”
Section: Discussionmentioning
confidence: 99%
“…For the approaches presented in this paper, we have fitted both a non-parametric and a parametric vine. The independence, empirical, Gaussian and Gaussian copula approaches are all implemented in the R package shapr [30], and the plan is to also include the approaches proposed in this paper. The simulation model is detailed in Section 5.1, the actual design of the experiments is given in Section 5.2.…”
Section: Simulation Studiesmentioning
confidence: 99%