2017 IEEE International Conference on Data Mining (ICDM) 2017
DOI: 10.1109/icdm.2017.127
|View full text |Cite
|
Sign up to set email alerts
|

Efficient Computation of Multiple Density-Based Clustering Hierarchies

Abstract: Abstract-HDBSCAN*, a state-of-the-art density-based hierarchical clustering method, produces a hierarchical organization of clusters in a dataset w.r.t. a parameter mpts. While the performance of HDBSCAN* is robust w.r.t. mpts in the sense that a small change in mpts typically leads to only a small or no change in the clustering structure, choosing a "good" mpts value can be challenging: depending on the data distribution, a high or low value for mpts may be more appropriate, and certain data clusters may reve… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
3
3
2

Relationship

3
5

Authors

Journals

citations
Cited by 18 publications
(8 citation statements)
references
References 17 publications
0
8
0
Order By: Relevance
“…Moreover, this RNG has typically much fewer edges than the complete graph so its construction cost is more than outweighed by the reduction in edge weight computations. This paper is an extension of [29]. We add here all the proofs that guarantee the correctness of our proposed approach.…”
Section: Introductionmentioning
confidence: 94%
“…Moreover, this RNG has typically much fewer edges than the complete graph so its construction cost is more than outweighed by the reduction in edge weight computations. This paper is an extension of [29]. We add here all the proofs that guarantee the correctness of our proposed approach.…”
Section: Introductionmentioning
confidence: 94%
“…Beyond the minimum cluster size (MinPts), which is much easier to choose than Eps, the method requires no further setting of arbitrary or biasing parameters by a user [28]. Though HDBSCAN is claimed to be robust [29], the algorithm to build the hierarchy runs in quadratic time, in both the worst and the best case [30].…”
Section: Research Backgroundmentioning
confidence: 99%
“…While traditional clustering techniques have provided many efficient schemes to detect irregularly shaped clusters (e.g., DBSCAN [52], OPTICS [10], spectral clustering [178], Chameleon [84], HDBSCAN [20,135], etc. ), these methods, as introduced earlier, do not incorporate statistical rigor required by domain applications of spatial hotspot detection (Sec.…”
Section: Enumerationmentioning
confidence: 99%