2004
DOI: 10.1023/b:dami.0000015869.08323.b3
|View full text |Cite
|
Sign up to set email alerts
|

Fast and Robust General Purpose Clustering Algorithms

Abstract: General purpose and highly applicable clustering methods are usually required during the early stages of knowledge discovery exercises. k-Means has been adopted as the prototype of iterative model-based clustering because of its speed, simplicity and capability to work within the format of very large databases. However, k-Means has several disadvantages derived from its statistical simplicity. We propose an algorithm that remains very efficient, generally applicable, multidimensional but is more robust to nois… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
35
0

Year Published

2005
2005
2022
2022

Publication Types

Select...
4
4
1

Relationship

0
9

Authors

Journals

citations
Cited by 68 publications
(35 citation statements)
references
References 34 publications
0
35
0
Order By: Relevance
“…PAM utilizes real data points (medoids) as the cluster prototypes and avoids the effect of outliers. Based on the same consideration, a -medoids algorithm is presented in [87] by searching the discrete 1-medians as the cluster centroids.…”
Section: )mentioning
confidence: 99%
“…PAM utilizes real data points (medoids) as the cluster prototypes and avoids the effect of outliers. Based on the same consideration, a -medoids algorithm is presented in [87] by searching the discrete 1-medians as the cluster centroids.…”
Section: )mentioning
confidence: 99%
“…In Ref. [37], a K-medoids algorithm is presented. In this cluster centroid is searched using the discrete 1-medians.…”
Section: Algorithmmentioning
confidence: 99%
“…In paper [29], [38] and [39] different variants of K-means algorithm has been used which can be applied to categorical data. The Proposed K-medoids [37] and K-modes algorithm operates in a similar way as K -means.…”
Section: Algorithmmentioning
confidence: 99%
“…On the other hand, clustering of pages will discover groups of pages having related content. Several researchers have applied data mining techniques to web server logs, attempting to unlock the usage patterns of web users hidden in the log files [13].…”
Section: Web User Clusteringmentioning
confidence: 99%