Tae-Chang Jee scite author profile

In this paper we proposed a fast method for a K-Means Clustering algorithm. The main characteristic of this method is that it uses precalculated data which possibility of change is high in order to speed up the algorithm. When calculating distance to cluster centre at each stage to assign nearest prototype in the clustering algorithm, it could reduce overall computation time by selecting only those data with possibility of change in cluster is high.Calculation time is reduced by using the distance information produced by K-Means algorithm when computing expected input data whose cluster may change, and by using such distance information the algorithm could be less affected by the number of dimensions. The proposed method was compared with original K-Means method -Lloyd's and the improved method KMHybrid. We show that our proposed method significantly outperforms in computation speed than Lloyd's and KMHybrid when using large size data which has large amount of data, great many dimensions and large number of clusters.■ keyword :|Pattern Recognition|Data Mining|K-Means Clustering|

show abstract

Social Networks Analysis using External Community Relationship

Lee¹,

Jee²

2011

Journal of Digital Contents Society

View full text Add to dashboard Cite

Visualization Method of Document Retrieval Result based on Centers of Clusters

Jee¹,

Lee²,

Lee³

2007

The Journal of the Korea Contents Association

View full text Add to dashboard Cite

A Study on Optimizing the Number of Clusters using External Cluster Relationship Criterion

Lee¹,

Jee²

2011

Journal of Digital Contents Society

View full text Add to dashboard Cite

The k-means has been one of the popular, simple and faster clustering algorithms, but the right value of k is unknown. The value of k (the number of clusters) is a very important element because the result of clustering is different depending on it. In this paper, we present a novel algorithm based on an external cluster relationship criterion which is an evaluation metric of clustering result to determine the number of clusters dynamically. Experimental results show that our algorithm is superior to other methods in terms of the accuracy of the number of clusters.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Tae-Chang Jee

Error correction of Korean courtesy amounts in bank slips using rule information and cross-referencing

Fast K-Means Clustering Algorithm using Prediction Data

Social Networks Analysis using External Community Relationship

Visualization Method of Document Retrieval Result based on Centers of Clusters

A Study on Optimizing the Number of Clusters using External Cluster Relationship Criterion

Contact Info

Product

Resources

About