2010
DOI: 10.1002/asi.21311
|View full text |Cite
|
Sign up to set email alerts
|

High‐speed rough clustering for very large document collections

Abstract: Document clustering is an important tool, but itis not yet widely used in practice probably because of its high computational complexity. This article explores techniques of high-speed rough clustering of documents, assuming that it is sometimes necessary to obtain a clustering result in a shorter time, although the result is just an approximate outline of document clusters. A promising approach for such clustering is to reduce the number of documents to be checked for generating cluster vectors in the leader-… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0
2

Year Published

2011
2011
2017
2017

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 9 publications
(8 citation statements)
references
References 23 publications
0
6
0
2
Order By: Relevance
“…Atanassov [21,22,25] , Atanassov and Gargov [23] and Xu [314] . [145] Herawan et al [125] In the literature, there has been a lively debate by some scholars on the appropriateness of the term intuitionistic fuzzy sets (IFS) adopted by Atanassov for his theory. Grzegorzewski and Mròwka [116] suggested a list of possible alternative terms that would however allow one to retain the acronym IFS: incomplete fuzzy sets, inaccurate fuzzy sets, imperfect fuzzy sets, indefinite fuzzy sets, indeterminate fuzzy sets, indistinct fuzzy sets.…”
Section: Intuitionistic Fuzzy Clusteringmentioning
confidence: 99%
“…Atanassov [21,22,25] , Atanassov and Gargov [23] and Xu [314] . [145] Herawan et al [125] In the literature, there has been a lively debate by some scholars on the appropriateness of the term intuitionistic fuzzy sets (IFS) adopted by Atanassov for his theory. Grzegorzewski and Mròwka [116] suggested a list of possible alternative terms that would however allow one to retain the acronym IFS: incomplete fuzzy sets, inaccurate fuzzy sets, imperfect fuzzy sets, indefinite fuzzy sets, indeterminate fuzzy sets, indistinct fuzzy sets.…”
Section: Intuitionistic Fuzzy Clusteringmentioning
confidence: 99%
“…For example, the K ‐means clustering algorithm takes a long time to compute, requires a priori knowledge regarding the number of clusters, and is sensitive to the choice of initial cluster centers. Several algorithms have been proposed to reduce computation time in cluster analysis by using parallel computation techniques or at the cost of the clustering results (Fernandez & Gomez, ; Kishida, ). Recently, we developed a minimum span clustering (MSC) method for clustering complex networks.…”
Section: Related Workmentioning
confidence: 99%
“…Moreover, HC involves the calculation of similarities between all pairs of components for each clustering level, and is not efficient for handling large data sets. Several algorithms have been proposed for reducing computation time in cluster analysis by using parallel computation techniques or at the cost of the clustering results [2,10,11]. However, most clustering algorithms still require pre-given assumptions as inputs for the classification of complex systems, including the number of clusters, cluster sizes and boundary conditions.…”
Section: Introductionmentioning
confidence: 99%