In this paper is explored a way to reduce the rate of reclustering and speed up the clustering process on categorical time-evolving data. This method introduces two algorithms RDE (Replicated Data Elimination) and RCRDE. The RDE algorithm removes the successive surveys of replicated data and considers counters to keep this data. Hence the number of created windows via the sliding window technique is limited and this leads to decrease the number of implementations of clustering algorithm. The RCRDE algorithm based on MARDL (MAximal Resemblance Data Labeling) framework decides about re-clustering implementation or modification of previous clustering results. The presented method is independent of clustering algorithm's type and any kind of categorical clustering algorithm can be used. According to the results obtained on different data sets, this method performs well in practice and facilitates the clustering implementation on categorical data. Also, this method can be utilized to cluster a very large categorical static database with higher quality than previous work.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.