2007 IEEE Symposium on Computational Intelligence and Bioinformatics and Computational Biology 2007
DOI: 10.1109/cibcb.2007.4221210
|View full text |Cite
|
Sign up to set email alerts
|

Clustering Microarrays with Predictive Weighted Ensembles

Abstract: Cluster ensembles seek a consensus across many individual partitions and the resulting solution is usually stable. Cluster ensembles are well suited to the analysis of DNA microarrays, where the tremendous size of the dataset can thwart the discovery of stable groups.Post processing cluster ensembles, where each individual partition is weighted according to its relative accuracy improves the performance of the ensemble whilst maintaining its stability. However, weighted cluster ensembles remain relatively unex… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
5
0

Year Published

2007
2007
2019
2019

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(5 citation statements)
references
References 25 publications
0
5
0
Order By: Relevance
“…To make the procedure more amenable for use with large datasets, a partition of the datasets could be considered in the manner of Smyth and Coomans [27]. For example, the first subset would be used to train the individual trees, the second subset would be used to calculate the lasso weights, and the third subset could be used as a test subset.…”
Section: Discussionmentioning
confidence: 99%
“…To make the procedure more amenable for use with large datasets, a partition of the datasets could be considered in the manner of Smyth and Coomans [27]. For example, the first subset would be used to train the individual trees, the second subset would be used to calculate the lasso weights, and the third subset could be used as a test subset.…”
Section: Discussionmentioning
confidence: 99%
“…When p is too large, the response space dimension can be reduced via either principal component analysis or factor analysis. In Smyth and Coomans (2007) The similarity function is defined as…”
Section: Weighting Clusteringsmentioning
confidence: 99%
“…There are a number of different consensus clustering approaches which can be divided into two categories based on their emphases [17], [18], [19], [20], [21], [22], [23], [24], [25], [26], [27], [28], [29], [30], [31], [32], [33]. The approaches [18], [19], [20], [21], [22], [23], [24], [25], [26], [27], [28], [29], [30] in the first category designed new consensus clustering techniques, and applied them to cancer gene expression profiles. For example, Duboit et al [18], [19] applied a prediction-based resampling consensus clustering approach named Clest to perform class discovery from four cancer microarray data sets.…”
Section: Related Workmentioning
confidence: 99%
“…They [23] also designed a new cluster ensemble approach based on the perturbation technique and the neural gas algorithm to perform clustering analysis on cancer data sets, and proposed a new cluster validity index to identify the number of cancer subtypes. Smyth and Coomans [24] presented a weighted consensus clustering approach by considering the relative accuracy of each clustering solution, and applied it to perform cluster analysis on cancer gene expression profiles. Valentini and Bertoni [25], [26], [27] introduced the randomized map-based consensus clustering approach, and applied it to discover the meaningful clusters from the gene expression profiles.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation