2020
DOI: 10.1007/978-3-030-58115-2_6
|View full text |Cite
|
Sign up to set email alerts
|

Learning a Formula of Interpretability to Learn Interpretable Formulas

Abstract: Many risk-sensitive applications require Machine Learning (ML) models to be interpretable. Attempts to obtain interpretable models typically rely on tuning, by trial-and-error, hyper-parameters of model complexity that are only loosely related to interpretability. We show that it is instead possible to take a meta-learning approach: an ML model of non-trivial Proxies of Human Interpretability (PHIs) can be learned from human feedback, then this model can be incorporated within an ML training process to directl… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

4
26
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
3
1

Relationship

2
5

Authors

Journals

citations
Cited by 23 publications
(30 citation statements)
references
References 46 publications
4
26
0
Order By: Relevance
“…Moreover, evolution prefers the ∧ and ¬ operators over the temporal operators (of which there is approximately one per formula). This finding is in line with our expectations since temporal operators are likely to be the least interpretable for a human (as confirmed for mathematical expressions in [23]). In the following, we transcribe some instances of best individuals.…”
Section: Rq2: Specifications That Are Readable and Interpretable For A Humansupporting
confidence: 92%
See 2 more Smart Citations
“…Moreover, evolution prefers the ∧ and ¬ operators over the temporal operators (of which there is approximately one per formula). This finding is in line with our expectations since temporal operators are likely to be the least interpretable for a human (as confirmed for mathematical expressions in [23]). In the following, we transcribe some instances of best individuals.…”
Section: Rq2: Specifications That Are Readable and Interpretable For A Humansupporting
confidence: 92%
“…However, we believe that posing a further limit on the composition of the temporal operators may make the STL formulas more readable, and not only just smaller. Our belief is corroborated by the findings of [23] for mathematical expressions: some operators, such as log and sin, make the expressions less interpretable than others, e.g., + and ÷.…”
Section: Grammar For Stl Formula Structuressupporting
confidence: 70%
See 1 more Smart Citation
“…Recently, an estimator of human-interpretability for symbolic models expressed as formulae was machine-learned from human feedback [58]. A survey was used to make users simulate the calculations of (random) formulae (an implementation of the XAI concept of simulatability [34]), and to identify the behavior of the formula when part of it would vary in some interval (an implementation of the XAI concept of decomposability [34]).…”
Section: Related Workmentioning
confidence: 99%
“…In this paper we build upon [58] and extend it in three ways: (1) we concurrently learn a model of intepretability that is specific to the user; (2) we use a more complex, non-linear estimator instead of a linear one; and (3) we exploit uncertainty estimation to require a small amount of feedback to train the estimator.…”
Section: Related Workmentioning
confidence: 99%