2012
DOI: 10.9790/0661-0354451
|View full text |Cite
|
Sign up to set email alerts
|

A Study in Employing Rough Set Based Approach for Clustering on Categorical Time-Evolving Data

Abstract: The proportionate increase in the size of the data with increase in space implies that clustering a very large data set becomes difficult and is a time consuming process. Sampling is one important technique to scale down the size of dataset and to improve the efficiency of clustering. After sampling, allocating unlabeled data point into proper cluster is difficult in the categorical domain and in real situations data changes over time. However, clustering this type of data not only decreases the quality of clu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3

Citation Types

0
3
0

Year Published

2013
2013
2024
2024

Publication Types

Select...
4
1
1

Relationship

2
4

Authors

Journals

citations
Cited by 8 publications
(3 citation statements)
references
References 11 publications
0
3
0
Order By: Relevance
“…This approach resulted in an accuracy of 70.33% using the Random Forest classifier and 66.54% with the Support Vector Machine (SVM). These methods were used for product recommendations [6]. The study employed a user-focused clustering method, utilizing a red wine dataset for analysis.…”
Section: Introductionmentioning
confidence: 99%
“…This approach resulted in an accuracy of 70.33% using the Random Forest classifier and 66.54% with the Support Vector Machine (SVM). These methods were used for product recommendations [6]. The study employed a user-focused clustering method, utilizing a red wine dataset for analysis.…”
Section: Introductionmentioning
confidence: 99%
“…An algorithm MMR (Min-Min-Roughness) for clustering categorical based on rough set theory is proposed by Parmer, Wu and Blackhurst in 2007 [14]. RST is also deals with vagueness and uncertainty in data analysis [15,16,17]. Shannon's entropy [18,19] is used to measure the uncertainty information.…”
Section: Introductionmentioning
confidence: 99%
“…As seen earlier clustering of categorical data is a tough task, we have theory to decrease the risk of clustering. Rough Set Theory(RST) is used to cluster the categorical data, this is used in determining the class labels of unknown data [12].it is a mathematical tool which was applied in many areas like dimensionality reduction, machine learning and pattern reduction to enhance the end results.paramet.al proposed rough set based on MMR (min-minroughness) for clustering the categorical data [13].in this method rough member ship function states similarity with a unlabeled data point [14,15,16]. For analyzing uncertain information Shannon's entropy process is used between data sets [17,18].…”
Section: Introductionmentioning
confidence: 99%