2002
DOI: 10.1093/bioinformatics/18.11.1462
|View full text |Cite
|
Sign up to set email alerts
|

Methods for assessing reproducibility of clustering patterns observed in analyses of microarray data

Abstract: We present statistical methods for testing for overall clustering of gene expression profiles, and we define easily interpretable measures of cluster-specific reproducibility that facilitate understanding of the clustering structure. We apply these methods to elucidate structure in cDNA microarray gene expression profiles obtained on melanoma tumors and on prostate specimens.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

3
156
0

Year Published

2003
2003
2017
2017

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 194 publications
(159 citation statements)
references
References 19 publications
3
156
0
Order By: Relevance
“…On a randomly selected subset of the tumors, a second independent reading was obtained and the agreement between the two readings was assessed using Samples and expression of progenitor cell markers were clustered using hierarchical clustering using Euclidian distance and complete linkage and presented using a heatmap. 17,18 To assess the clustering stability, an R index was calculated by perturbing the data through the introduction of random noise and then re-clustering the perturbed data; this process is repeated multiple times, and the results are compared with the original cluster of unperturbed data. The R index is the proportion of times that a patient pair clusters the same way in the perturbed data sets as in the original data set.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…On a randomly selected subset of the tumors, a second independent reading was obtained and the agreement between the two readings was assessed using Samples and expression of progenitor cell markers were clustered using hierarchical clustering using Euclidian distance and complete linkage and presented using a heatmap. 17,18 To assess the clustering stability, an R index was calculated by perturbing the data through the introduction of random noise and then re-clustering the perturbed data; this process is repeated multiple times, and the results are compared with the original cluster of unperturbed data. The R index is the proportion of times that a patient pair clusters the same way in the perturbed data sets as in the original data set.…”
Section: Discussionmentioning
confidence: 99%
“…In this study, the perturbed data sets were calculated by introducing a small probability (5%) of flipping the binary marker values rather than by the introduction of Gaussian noise. 18 …”
Section: Discussionmentioning
confidence: 99%
“…Nevertheless, the stability and reliability of the obtained clusters is crucial to assess the confidence and the significance of a bio-medical discovery [33,34].…”
Section: Introductionmentioning
confidence: 99%
“…Some recent approaches to estimate the reliability of the discovered clusters are based on the concept of the stability with respect to perturbations [33][34][35]. In the context of gene expression data, that are usually characterized by relatively high level of noise [36], stability can be considered an important property: how much the characteristics and composition of the discovered clusters hold when perturbation such as added noise, resampling or random projections are introduced?…”
Section: Introductionmentioning
confidence: 99%
“…The molecular classification in this study did not include the current clinical parameters like tumour grade, steroid receptor status, and HER-2/ neu. In essence, there may be more clusters and molecular subtypes of breast cancer that may be apparent if larger sample sets are available (McShane et al, 2002). Such formal statistical testing has not yet been carried out on the current molecular classification.…”
Section: Expression Microarray Analyses For the Identification Of Promentioning
confidence: 99%