Proceedings of the 1998 ACM SIGMOD International Conference on Management of Data 1998
DOI: 10.1145/276304.276314
|View full text |Cite
|
Sign up to set email alerts
|

Automatic subspace clustering of high dimensional data for data mining applications

Abstract: Abstract. Data mining applications place special requirements on clustering algorithms including: the ability to find clusters embedded in subspaces of high dimensional data, scalability, end-user comprehensibility of the results, non-presumption of any canonical data distribution, and insensitivity to the order of input records. We present CLIQUE, a clustering algorithm that satisfies each of these requirements. CLIQUE identifies dense clusters in subspaces of maximum dimensionality. It generates cluster desc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
683
1
9

Year Published

2002
2002
2015
2015

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 1,569 publications
(693 citation statements)
references
References 25 publications
0
683
1
9
Order By: Relevance
“…Subspace clustering has been studied extensively on scalable methods for mining various kinds of subspace clusters, such as grid-based (Agrawal et al 1998), entropy-based (Cheng et al 1999), and density-based (Baumgartner et al 2004;Bohm et al 2004;Kriegel et al 2005;Assent et al 2007) approaches. The above approaches may suffer from the problem of a sheer size of redundancy results.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Subspace clustering has been studied extensively on scalable methods for mining various kinds of subspace clusters, such as grid-based (Agrawal et al 1998), entropy-based (Cheng et al 1999), and density-based (Baumgartner et al 2004;Bohm et al 2004;Kriegel et al 2005;Assent et al 2007) approaches. The above approaches may suffer from the problem of a sheer size of redundancy results.…”
Section: Related Workmentioning
confidence: 99%
“…Condition 3 is denoted as Density Condition which is also used in CLIQUE (Agrawal et al 1998) for identifying dense units. Conceptually, this definition is similar to those in grid-based clustering and density-based clustering.…”
Section: Problem Statementmentioning
confidence: 99%
“…A lot of clustering algorithms have been proposed in the literature [1,2]: BIRCH [42], for example, clusters the data with a single scan of the space, since the assignment is based on some local characteristics, without considering the other clusters. An important issue for the problem we deal with is to discover data clusters that follow a Gaussian distribution in a multi-dimensional space.…”
Section: The Clustering Proceduresmentioning
confidence: 99%
“…Cells which having more than specified numbers of points are called dense cells. The dense cells are connected to form clusters [13], some of which are: STING [16] and CLIQUE [17].…”
Section: Introductionmentioning
confidence: 99%