1998
DOI: 10.1145/276305.276314
|View full text |Cite
|
Sign up to set email alerts
|

Automatic subspace clustering of high dimensional data for data mining applications

Abstract: Abstract. Data mining applications place special requirements on clustering algorithms including: the ability to find clusters embedded in subspaces of high dimensional data, scalability, end-user comprehensibility of the results, non-presumption of any canonical data distribution, and insensitivity to the order of input records. We present CLIQUE, a clustering algorithm that satisfies each of these requirements. CLIQUE identifies dense clusters in subspaces of maximum dimensionality. It generates cluster desc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
608
0
17

Year Published

2001
2001
2017
2017

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 830 publications
(626 citation statements)
references
References 33 publications
1
608
0
17
Order By: Relevance
“…The problem of co-clustering is also closely related to the problem of subspace clustering [7] or projected clustering [5] in quantitative data in the database literature. In this problem, the data is clustered by simultaneously associating it with a set of points and subspaces in multi-dimensional space.…”
Section: Co-clustering Words and Documentsmentioning
confidence: 99%
“…The problem of co-clustering is also closely related to the problem of subspace clustering [7] or projected clustering [5] in quantitative data in the database literature. In this problem, the data is clustered by simultaneously associating it with a set of points and subspaces in multi-dimensional space.…”
Section: Co-clustering Words and Documentsmentioning
confidence: 99%
“…In [3], they define clusters in euclidean space by DNF formulas and address performance issues for data mining applications. In [87], the drawbacks of random sampling in clustering algorithms (e.g., small clusters might be missed) are avoided by density biased sampling.…”
Section: Bibliographical Notesmentioning
confidence: 99%
“…Contiguous dense cells are connected to form clusters. Examples of grid-based clustering methods include STING [15] and CLIQUE [16].…”
Section: Categorization Of Clustering Algorithmsmentioning
confidence: 99%