2012
DOI: 10.1371/journal.pone.0046128
|View full text |Cite
|
Sign up to set email alerts
|

Length Bias Correction in Gene Ontology Enrichment Analysis Using Logistic Regression

Abstract: When assessing differential gene expression from RNA sequencing data, commonly used statistical tests tend to have greater power to detect differential expression of genes encoding longer transcripts. This phenomenon, called “length bias”, will influence subsequent analyses such as Gene Ontology enrichment analysis. In the presence of length bias, Gene Ontology categories that include longer genes are more likely to be identified as enriched. These categories, however, are not necessarily biologically more rel… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
32
0

Year Published

2014
2014
2022
2022

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 32 publications
(32 citation statements)
references
References 25 publications
0
32
0
Order By: Relevance
“…We used the results from each genome-wide scan as input into tests of groups of genes of common function for enriched association signal, using GOglm [37]. Emerging from this analysis were eight GO terms whose genes tended to associate with different aspects of metabolism (Additional file 1: Table S3).…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…We used the results from each genome-wide scan as input into tests of groups of genes of common function for enriched association signal, using GOglm [37]. Emerging from this analysis were eight GO terms whose genes tended to associate with different aspects of metabolism (Additional file 1: Table S3).…”
Section: Resultsmentioning
confidence: 99%
“…Given the results from a single-trait or cross-phenotype association scan across markers as above, we first tabulated the single best-scoring marker for every gene tested. We then used the ranked list of these genes as input into GOglm, which tests for GO term enrichment in a ranked list without arbitrary thresholding into significant and insignificant genes, and corrects for gene length effects [37]. Only GO terms with >10 members with association results were considered for analysis.…”
Section: Methodsmentioning
confidence: 99%
“…A similar effect is present in the analysis of differential expression of RNASeq data: longer transcripts generate a greater number of reads and are more likely to be detected as differentially expressed compared with their short counterparts [179]. Several enrichment-based algorithms explicitly take into account this long-gene effect, including GOSeq [180], SeqGSEA [181], GSVA [174], and GOglm [182]. …”
Section: Functional Interpretation Of Datamentioning
confidence: 99%
“…Additionally, the top 2 sub-networks were visualized by Cytoscape. Furthermore, for the genes in top 10 sub-networks, GO enrichment analysis [22] was performed through BINGO (Biological Networks Gene Ontology tool) [23], a Cytoscape plugin used to assess the ontology categories of genes in biological networks. Then, the bio-functional regions with p value<0.05 were identified.…”
Section: Sub-network Exploring and Enrichment Analysismentioning
confidence: 99%