2021
DOI: 10.48550/arxiv.2112.05653
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Interpretable Clustering via Multi-Polytope Machines

Abstract: Clustering is a popular unsupervised learning tool often used to discover groups within a larger population such as customer segments, or patient subtypes. However, despite its use as a tool for subgroup discovery and description -few stateof-the-art algorithms provide any rationale or description behind the clusters found. We propose a novel approach for interpretable clustering that both clusters data points and constructs polytopes around the discovered clusters to explain them. Our framework allows for add… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(4 citation statements)
references
References 20 publications
0
4
0
Order By: Relevance
“…While this may seem similar to polyhedral cluster descriptions it is important to note that the objective is the k-means clustering cost, not explicitly interpretability, and operates only with a k-means reference clustering not a general clustering. Most similar to our work is the use of multi-polytope machines to perform the clustering [24]. However, our approach differs from this line of work as the cluster assignments are fixed in the cluster description problem, and the aim is optimize interpretability not the quality of the clustering itself.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations
“…While this may seem similar to polyhedral cluster descriptions it is important to note that the objective is the k-means clustering cost, not explicitly interpretability, and operates only with a k-means reference clustering not a general clustering. Most similar to our work is the use of multi-polytope machines to perform the clustering [24]. However, our approach differs from this line of work as the cluster assignments are fixed in the cluster description problem, and the aim is optimize interpretability not the quality of the clustering itself.…”
Section: Related Workmentioning
confidence: 99%
“…The first is to put a constraint on the complexity of the polyhedral description. Similar to previous work on rule sets [24], we define complexity of a half-space as the number of non-zero terms in the half-space plus one, and the complexity of the polyhedron as the sum of the complexities of the half-spaces that compose it. We call this variant of the PDP with complexity constraint, the Low-Complexity PDP (LC-PDP).…”
Section: Problem Formulationmentioning
confidence: 99%
See 2 more Smart Citations