2023
DOI: 10.48550/arxiv.2302.04791
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Symbolic Metamodels for Interpreting Black-boxes Using Primitive Functions

Abstract: One approach for interpreting black-box machine learning models is to find a global approximation of the model using simple interpretable functions, which is called a metamodel (a model of the model). Approximating the black-box with a metamodel can be used to 1) estimate instance-wise feature importance; 2) understand the functional form of the model; 3) analyze feature interactions. In this work, we propose a new method for finding interpretable metamodels. Our approach utilizes Kolmogorov superposition theo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 30 publications
0
1
0
Order By: Relevance
“…Amazon had been using automated software since 2014 to assess applicants' resumes, which were found to be biased against women (Dastin, 2018). As a result, there have been several works focusing on interpreting machine learning models to understand how features and sensitive attributes contribute to the output of the model (Ribeiro et al, 2016;Lundberg and Lee, 2017;Abroshan et al, 2023).…”
Section: Introductionmentioning
confidence: 99%
“…Amazon had been using automated software since 2014 to assess applicants' resumes, which were found to be biased against women (Dastin, 2018). As a result, there have been several works focusing on interpreting machine learning models to understand how features and sensitive attributes contribute to the output of the model (Ribeiro et al, 2016;Lundberg and Lee, 2017;Abroshan et al, 2023).…”
Section: Introductionmentioning
confidence: 99%