2014
DOI: 10.1109/tfuzz.2013.2286993
|View full text |Cite
|
Sign up to set email alerts
|

Accelerating Fuzzy-C Means Using an Estimated Subsample Size

Abstract: Many algorithms designed to accelerate the Fuzzy c-Means (FCM) clustering algorithm randomly sample the data. Typically, no statistical method is used to estimate the subsample size, despite the impact subsample sizes have on speed and quality. This paper introduces two new accelerated algorithms, GOFCM and MSERFCM, that use a statistical method to estimate the subsample size. GOFCM, a variant of SPFCM, also leverages progressive sampling. MSERFCM, a variant of rseFCM, gains a speedup from improved initializat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
28
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 80 publications
(28 citation statements)
references
References 47 publications
0
28
0
Order By: Relevance
“…[29][30][31][32] The advantages of FCM over k-means or any other clustering algorithm is that an object can lie under any number of clusters with or without overlap and its intent of belongingness can be estimated according to the following constraints with n, and c being number of data samples and clusters, respectively: [29][30][31][32] The advantages of FCM over k-means or any other clustering algorithm is that an object can lie under any number of clusters with or without overlap and its intent of belongingness can be estimated according to the following constraints with n, and c being number of data samples and clusters, respectively:…”
Section: Fuzzy C Meansmentioning
confidence: 99%
See 1 more Smart Citation
“…[29][30][31][32] The advantages of FCM over k-means or any other clustering algorithm is that an object can lie under any number of clusters with or without overlap and its intent of belongingness can be estimated according to the following constraints with n, and c being number of data samples and clusters, respectively: [29][30][31][32] The advantages of FCM over k-means or any other clustering algorithm is that an object can lie under any number of clusters with or without overlap and its intent of belongingness can be estimated according to the following constraints with n, and c being number of data samples and clusters, respectively:…”
Section: Fuzzy C Meansmentioning
confidence: 99%
“…FCM is one among the popular clustering algorithm, which samples the data without following the statistical approach for determining the size of the subsample. [29][30][31][32] The advantages of FCM over k-means or any other clustering algorithm is that an object can lie under any number of clusters with or without overlap and its intent of belongingness can be estimated according to the following constraints with n, and c being number of data samples and clusters, respectively:…”
Section: Fuzzy C Meansmentioning
confidence: 99%
“…Since FCM choose the initial centers randomly, the final result and especially its convergence speed significantly depends on the original center selection. A method proposed to address this problem is based on estimated subsample size to improve the initialization [18]. In the field of clustering large amounts of data, three types of methods have been proposed:…”
Section: Related Workmentioning
confidence: 99%
“…First is the distance measure strategy and second is initial centroids selection strategy to minimize processing speed and increase stability. Paper [4] introduces two accelerated clustering algorithms using estimated subsample size and the novel stopping criterion. Authors in the paper [5] present a systematic study of kmeans-based consensus clustering algorithm, identify necessary and sufficient conditions for the algorithms on both pure and noisy datasets.…”
Section: Related Workmentioning
confidence: 99%
“…The authors in the paper [12] propose two novel enhanced algorithms such as geometric progressive fuzzy c-means and minimum sample estimate random fuzzy c-means by using some statistical techniques. This is to compute the size subsamples.…”
Section: Related Workmentioning
confidence: 99%