2004
DOI: 10.1093/bioinformatics/bth362
|View full text |Cite
|
Sign up to set email alerts
|

The CRASSS plug-in for integrating annotation data with hierarchical clustering results

Abstract: We describe an algorithm for finding the most statistically significant non-overlapping subtrees of a hierarchical clustering of gene expression data with respect to a set of secondary data labels on genes. The method is implemented as a Java plug-in for a commercial gene expression analysis program (GeneSpring).

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
12
0

Year Published

2005
2005
2022
2022

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 12 publications
(12 citation statements)
references
References 12 publications
0
12
0
Order By: Relevance
“…To our knowledge, in the whole bioinformatics literature there have been only four recent studies in which it has been attempted to establish general methods to compare hierarchical and non-hierarchical classifications, all of them in the context of microarray data analysis. In two of these studies [ 15 , 16 ], the method is similar, and very much related to those used for the two simpler cases discussed above and exemplified in Figures 1A and 1B . Starting with a hierarchical classification of expression data, which may be obtained with any conventional method, such as UPGMA, the degree of enrichment for a particular class (i.e.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…To our knowledge, in the whole bioinformatics literature there have been only four recent studies in which it has been attempted to establish general methods to compare hierarchical and non-hierarchical classifications, all of them in the context of microarray data analysis. In two of these studies [ 15 , 16 ], the method is similar, and very much related to those used for the two simpler cases discussed above and exemplified in Figures 1A and 1B . Starting with a hierarchical classification of expression data, which may be obtained with any conventional method, such as UPGMA, the degree of enrichment for a particular class (i.e.…”
Section: Introductionmentioning
confidence: 99%
“…The process is repeated until all non-overlapping clusters with small p values are determined. Finally, Bonferroni's correction is used to take into account the effect of multiple tests either considering the number of classes tested [ 15 ] or the number of clusters tested [ 16 ]. A third study followed the same strategy, but only up to the calculation of the p values, without further refinement of the results [ 17 ].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Traditionally, the known annotations are used only as a second step, after data have been clustered according to their variation patterns. Only those clusters in which many genes (and proteins/metabolites) are annotated within the same category (for example, the same MapMan BIN [14] or Gene Ontology (GO) terms [15]), are then selected for further analysis [16-19]. For each pattern, its annotations and memberships to well-known metabolic pathways are generally assessed.…”
Section: Introductionmentioning
confidence: 99%
“…Only those clusters in which many genes are annotated with the same annotation (e.g. the same biological process), are then selected for further analysis (Buehler, 2004; Curtis, 2005; Doherty, 2006; Toronen, 2004; and others). Fang et al (2006) took the opposite approach, first mapping the genes involved in the expression dataset to the GO hierarchy, and then looking only at those GO terms for which the mapped genes show high expression similarity.…”
Section: Introductionmentioning
confidence: 99%