Abstract:There exists large number of clustering algorithms either for numeric or for categorical data sets. There are relatively less algorithms for clustering mixed attributes. This paper proposes Mutual Information based Weighted Clustering for Mixed Attributes (MI-WCMA) based on euclidean distance for numeric attributes, distance measure based on similarity for categorical attributes using rough sets and weights for features based on average mutual information. The metrics accuracy, silhouette width and kappa co-ef… Show more
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.