2010
DOI: 10.1093/bioinformatics/btq203
|View full text |Cite
|
Sign up to set email alerts
|

Assessing the functional coherence of gene sets with metrics based on the Gene Ontology graph

Abstract: Motivation: The results of initial analyses for many high-throughput technologies commonly take the form of gene or protein sets, and one of the ensuing tasks is to evaluate the functional coherence of these sets. The study of gene set function most commonly makes use of controlled vocabulary in the form of ontology annotations. For a given gene set, the statistical significance of observing these annotations or ‘enrichment’ may be tested using a number of methods. Instead of testing for significance of indivi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
31
0

Year Published

2011
2011
2016
2016

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 23 publications
(31 citation statements)
references
References 39 publications
(57 reference statements)
0
31
0
Order By: Relevance
“…Gene Ontology terms, for example, are actually organized in a tree-like hierarchy. This would be an interesting feature to explore with respect to modeling, visualization and enrichment algorithms 24 . The relationships rendered as edges in EGAN (entity-entity relationships and entity-set associations) are discrete: a relationship exists or it does not.…”
Section: Discussionmentioning
confidence: 99%
“…Gene Ontology terms, for example, are actually organized in a tree-like hierarchy. This would be an interesting feature to explore with respect to modeling, visualization and enrichment algorithms 24 . The relationships rendered as edges in EGAN (entity-entity relationships and entity-set associations) are discrete: a relationship exists or it does not.…”
Section: Discussionmentioning
confidence: 99%
“…where i and j indicate two GO terms, d G is the set of edge weights for all the edges in the GOGraph G , d P5 is the value of the weight of the 5th percentile, and | g i, j | is the number of genes that are shared between the GO terms i and j [26]. After augmenting the graph, we then found a Steiner tree and used its total weight to reflect the total information loss.…”
Section: Methodsmentioning
confidence: 99%
“…We use the Nadaraya-Watson non-parametric regression [26,28] to capture the non-linear relationship between the size n and the parameters of random gene sets of the same size, using the following equations:…”
Section: Methodsmentioning
confidence: 99%
“…An important application of the GO is to perform the enrichment analysis that identifies which GO categories are over-represented on the gene sets of interest. Along this line, the use of GO graphs for coding the relationships among annotations was shown to further improve the enrichment analysis [19].…”
Section: Introductionmentioning
confidence: 99%