Abstract:Data mining techniques are used to extract useful patterns from a large data set. k-mean algorithm is one of the most famous partitioning clustering algorithm. But, Euclidean distance is sensitive to outliers and is suitable to only numeric values. Real time datasets have mixed attribute values, missing values and measurements are not in the standard format.The proposed algorithm extends the ability of the kmean algorithm to use a mixed simil arity measure to find the similarity between data objects for cluste… Show more
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.