Clustering is the process of grouping similar data into a set of clusters. Cluster analysis is one of the major data analysis techniques and k-means one of the most popular partitioning clustering algorithm that is widely used. But the original k-means algorithm is computationally expensive and the resulting set of clusters strongly depends on the selection of initial centroids. Several methods have been proposed to improve the performance of k-means clustering algorithm. In this paper we propose a heuristic method to find better initial centroids as well as more accurate clusters with less computational time. Experimental results show that the proposed algorithm generates clusters with better accuracy thus improve the performance of k-means clustering algorithm.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.