2006
DOI: 10.1007/s10852-005-9022-1
|View full text |Cite
|
Sign up to set email alerts
|

Clustering Rules: A Comparison of Partitioning and Hierarchical Clustering Algorithms

Abstract: Previous research has resulted in a number of different algorithms for rule discovery. Two approaches discussed here, the ‘all-rules’ algorithm and multi-objective metaheuristics, both result in the production of a large number of partial classification rules, or ‘nuggets’, for describing different subsets of the records in the class of interest. This paper describes the application of a number of different clustering algorithms to these rules, in order to identify similar rules and to better understand the da… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
377
0
8

Year Published

2014
2014
2023
2023

Publication Types

Select...
9
1

Relationship

0
10

Authors

Journals

citations
Cited by 514 publications
(387 citation statements)
references
References 8 publications
2
377
0
8
Order By: Relevance
“…These profiles were concatenated with the profiles of 1,267 samples distributed with the IGC. A Jensen-Shannon divergence matrix was calculated using the phyloseq 61 implementation over all profiles, and partitioning around medoids 62 was performed to obtain enterotype annotations for each sample.…”
Section: Methodsmentioning
confidence: 99%
“…These profiles were concatenated with the profiles of 1,267 samples distributed with the IGC. A Jensen-Shannon divergence matrix was calculated using the phyloseq 61 implementation over all profiles, and partitioning around medoids 62 was performed to obtain enterotype annotations for each sample.…”
Section: Methodsmentioning
confidence: 99%
“…In accordance with this, an initial clustering analysis is conducted by an unsupervised learning machine, Partition Around Mediods (PAM; [25]; R package cluster). One advantage of this partition clustering technique relies on the fact that it can work with different similarity measurements other than the Euclidean distance.…”
Section: Classification Processmentioning
confidence: 99%
“…The expression of each gene was split into 'high' and 'low' based on Partition Around Medoids clustering. 21,22 The performance of a gene was evaluated using a REM comparing the 10-year metastasis rate in high versus low expression also using the R 'meta' package. Pooled multivariate logistic regression analysis was performed, with age analyzed per year, Gleason and PSA stratified into high/low (Gleason 8-10/ ⩽ 7, PSA48/ ⩽ 8), and stratification by cohort as described previously 18 to account for baseline differences, both measured and unmeasured, between cohorts.…”
Section: Univariate and Multivariate Analysesmentioning
confidence: 99%