2020
DOI: 10.1007/s00357-019-09350-4
|View full text |Cite
|
Sign up to set email alerts
|

C443: a Methodology to See a Forest for the Trees

Abstract: Often tree-based accounts of statistical learning problems yield multiple decision trees which together constitute a forest. Reasons for this include examining tree instability, improving prediction accuracy, accounting for missingness in the data, and taking into account multiple outcome variables. A key disadvantage of forests, unlike individual decision trees, is their lack of transparency. Hence, an obvious challenge is whether it is possible to recover some of the insightfulness of individual trees from a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
16
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(16 citation statements)
references
References 31 publications
0
16
0
Order By: Relevance
“…RF and other classification tree ensembles, however, are often considered black-box techniques (similar to SVM). Recently, there have been advances in developing interpretational tools for RF and other black box techniques (Sies and Van Mechelen 2020;Ribeiro et al 2016). Others have adapted ensemble methods with trees to increase interpretability (Meinshausen 2010) and searched for optimal tree ensembles (Khan et al 2020(Khan et al , 2021.…”
Section: Conclusion and Discussionmentioning
confidence: 99%
“…RF and other classification tree ensembles, however, are often considered black-box techniques (similar to SVM). Recently, there have been advances in developing interpretational tools for RF and other black box techniques (Sies and Van Mechelen 2020;Ribeiro et al 2016). Others have adapted ensemble methods with trees to increase interpretability (Meinshausen 2010) and searched for optimal tree ensembles (Khan et al 2020(Khan et al , 2021.…”
Section: Conclusion and Discussionmentioning
confidence: 99%
“…This representation was proposed by [17], where a variety of possibilities was presented, and it consists of an adaptation of the approach from [27], that considered a binary representation. We prefer the former representation over a binary one for its ability to take into account multiple splits on the same feature.…”
Section: Proposed Methodsmentioning
confidence: 99%
“…For this purpose we select from the related work Section 2 the methods that: 1) have a publicly available source code, and 2) provide a direct way to retrieve the predictions of the surrogate model. The selection narrows down to the C443 [17] method, which is limited to the binary classification set-up. The authors propose a similar approach to our work with their tree extraction algorithm but focus more on giving an overview of the possible approaches rather than optimising the performance of a surrogate model.…”
Section: Competing Methodsmentioning
confidence: 99%
See 2 more Smart Citations