2021
DOI: 10.48550/arxiv.2103.15965
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Strong Optimal Classification Trees

Abstract: Decision trees are among the most popular machine learning models and are used routinely in applications ranging from revenue management and medicine to bioinformatics. In this paper, we consider the problem of learning optimal binary classification trees. Literature on the topic has burgeoned in recent years, motivated both by the empirical suboptimality of heuristic approaches and the tremendous improvements in mixedinteger optimization (MIO) technology. Yet, existing MIO-based approaches from the literature… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
14
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(14 citation statements)
references
References 43 publications
0
14
0
Order By: Relevance
“…Building upon SOCTs [26,27], the decision tree problem is formulated in a max-flow based model for computational efficiency.…”
Section: Improved Strong Optimal Classification Trees (Isocts)mentioning
confidence: 99%
See 3 more Smart Citations
“…Building upon SOCTs [26,27], the decision tree problem is formulated in a max-flow based model for computational efficiency.…”
Section: Improved Strong Optimal Classification Trees (Isocts)mentioning
confidence: 99%
“…The SOCT model [26,27], especially its branching constraints, is constructed for binary features. These constraints are as follows:…”
Section: Branching Threshold Constraints For Continuous Featuresmentioning
confidence: 99%
See 2 more Smart Citations
“…Recently, many works have directly optimized a performance metric (e.g., accuracy) with soft or hard sparsity constraints on the tree size. Such decision tree optimization problems can be formulated using mixed integer programming (MIP) (Bertsimas and Dunn 2017;Verwer and Zhang 2019;Vilas Boas et al 2021;Günlük et al 2021;Rudin and Ertekin 2018;Aghaei, Gómez, and Vayanos 2021). Other approaches use SAT solvers to find optimal decision trees (Narodytska et al 2018;Hu et al 2020), though these techniques require data to be perfectly separable, which is not typical for machine learning.…”
Section: Related Workmentioning
confidence: 99%