Clustering is one of the critical parts of machine learning algorithms. K-Means clustering is the standard technique that various data analysts use for clustering the data among the various clusters. Even though the K means clustering algorithm can work effectively, there is a need to tune the value of K according to the dataset under consideration. The process of tuning for the value of k requires the execution of the Kmeans algorithm with different values of k. The values of k with the best cluster quality based on specific metrics are selected. The elbow method and silhouette coefficient is the most popular approach for selecting the number of clusters. However, both approaches are time-consuming, as they have to execute K-means for each value of k to find a good score. This approach is iterative. This paper proposed a divide and conquer approach that performs the same task in less time. From the proposed approach, the task has been completely 2.5 times faster in comparison to the iterative method at the cost of some memory required to store the results.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.