2021
DOI: 10.1016/j.softx.2020.100642
|View full text |Cite
|
Sign up to set email alerts
|

Fundamental clustering algorithms suite

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
29
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
3

Relationship

2
5

Authors

Journals

citations
Cited by 37 publications
(29 citation statements)
references
References 42 publications
0
29
0
Order By: Relevance
“…Benchmarking will be performed on two high-dimensional datasets (D > 7000 and D > 18,000) and four artificially defined data structures with 41 clustering algorithms that are available in a previously published clustering suite 82 . The high-dimensional datasets possess one true partition of the data, which was verified by various methods and a domain expert.…”
Section: Methodsmentioning
confidence: 99%
“…Benchmarking will be performed on two high-dimensional datasets (D > 7000 and D > 18,000) and four artificially defined data structures with 41 clustering algorithms that are available in a previously published clustering suite 82 . The high-dimensional datasets possess one true partition of the data, which was verified by various methods and a domain expert.…”
Section: Methodsmentioning
confidence: 99%
“…Often, the user cannot manually change the distance metric (c.f. 54 common algorithms listed in [45]), resulting in the implicit application of the Euclidean metric if not otherwise specified. In contrast, projection-based clustering has the advantage that a specific distance metric can be selected by the user, which is then used in the dimensionality reduction and clustering part of the algorithm.…”
Section: Step I: Identification Of Structures In the Datamentioning
confidence: 99%
“…The definition of a cluster remains a matter of ongoing discussion [60,61]. Therefore, the justification for an appropriate choice for any one of the over fifty currently available open-source clustering algorithms [45] is proposed using data-driven criteria of Gaussian mixture models of distances, topographic maps, heatmaps, and Occam's razor (in the following Section 2.1.1, Section 2.1.3, Section 2.2.1, and Section 2.2.2). Additionally, a benchmarking of 34 clustering algorithms revealed for a large variety of distance and density-based cluster structures that projection-based clustering (PBC) is an appropriate choice [42].…”
Section: Step Ii: Cluster Analysis With Projection-based Clustering (mentioning
confidence: 99%
See 2 more Smart Citations