2002 IEEE International Conference on Data Mining, 2002. Proceedings.
DOI: 10.1109/icdm.2002.1183895
|View full text |Cite
|
Sign up to set email alerts
|

Iterative clustering of high dimensional text data augmented by local search

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
64
0

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 78 publications
(64 citation statements)
references
References 7 publications
0
64
0
Order By: Relevance
“…Fortunately, Gmeans was recently compared to CLUTO in the benchmarking study of Tunali,Çamurcu, and Bilgin (2010) who provide modified source code via http://www.dataminingresearch.com/index.php/2010/ 06/gmeans-clustering-software-compatible-with-gcc-4/ which can be compiled using current versions of GCC. Gmeans uses the fixed-point algorithm combined with the first variation local improvement strategy of Dhillon et al (2002) (as also available for method "pclust") and provides a choice among six different initialization methods. By default, no first variations are performed, and the initial prototypes are chosen by first determining the spherical 1-means prototype i x i / x i , and then repeatedly picking the x i most dissimilar to the already chosen prototypes as the next prototype (i.e., by default prototypes are initialized in a deterministic way).…”
Section: Methods "Gmeans"mentioning
confidence: 99%
See 4 more Smart Citations
“…Fortunately, Gmeans was recently compared to CLUTO in the benchmarking study of Tunali,Çamurcu, and Bilgin (2010) who provide modified source code via http://www.dataminingresearch.com/index.php/2010/ 06/gmeans-clustering-software-compatible-with-gcc-4/ which can be compiled using current versions of GCC. Gmeans uses the fixed-point algorithm combined with the first variation local improvement strategy of Dhillon et al (2002) (as also available for method "pclust") and provides a choice among six different initialization methods. By default, no first variations are performed, and the initial prototypes are chosen by first determining the spherical 1-means prototype i x i / x i , and then repeatedly picking the x i most dissimilar to the already chosen prototypes as the next prototype (i.e., by default prototypes are initialized in a deterministic way).…”
Section: Methods "Gmeans"mentioning
confidence: 99%
“…In general, the corresponding computations may be prohibitively expensive. For the standard spherical k-means problem, Dhillon, Guan, and Kogan (2002) note that the effect of moving the i-th object from its cluster j = c(i) to a different cluster l can be obtained as follows. Let Ψ(M ) = min P Φ(M, P ) be the minimal criterion value for fixed M .…”
Section: The Standard Spherical K-means Problemmentioning
confidence: 99%
See 3 more Smart Citations