2010
DOI: 10.1371/journal.pone.0012336
|View full text |Cite
|
Sign up to set email alerts
|

Should We Abandon the t-Test in the Analysis of Gene Expression Microarray Data: A Comparison of Variance Modeling Strategies

Abstract: High-throughput post-genomic studies are now routinely and promisingly investigated in biological and biomedical research. The main statistical approach to select genes differentially expressed between two groups is to apply a t-test, which is subject of criticism in the literature. Numerous alternatives have been developed based on different and innovative variance modeling strategies. However, a critical issue is that selecting a different test usually leads to a different gene list. In this context and give… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
112
0
2

Year Published

2012
2012
2020
2020

Publication Types

Select...
5
3
2

Relationship

0
10

Authors

Journals

citations
Cited by 139 publications
(114 citation statements)
references
References 31 publications
0
112
0
2
Order By: Relevance
“…Significantly regulated metabolites were evaluated by pairwise comparison of control to low or high concentration for each time point using LIMMA, which has a higher discriminatory power than t tests without increasing the false positive rate, especially when dealing with small sample sizes (39). The returned p values were corrected for multiple testing according to Hochberg and Benjamini (35).…”
Section: Resultsmentioning
confidence: 99%
“…Significantly regulated metabolites were evaluated by pairwise comparison of control to low or high concentration for each time point using LIMMA, which has a higher discriminatory power than t tests without increasing the false positive rate, especially when dealing with small sample sizes (39). The returned p values were corrected for multiple testing according to Hochberg and Benjamini (35).…”
Section: Resultsmentioning
confidence: 99%
“…Besides data pre-processing, we used a combination of three different algorithms: SAM (Tusher et al, 2001), LIMMA (Wettenhall and Smyth, 2004) and SMVar (Jaffrezic et al, 2007). These algorithms were chosen because they are representative of different variance modelling strategies in gene expression data (Jeanmougin et al, 2010). In total, 786 genes were identified as being significantly up regulated in S. cerevisiae, 327 genes in C. glabrata and 337 genes in C. albicans ( Figure 1, step 1).…”
Section: Step 1: Genome-wide Expression Data To Measure the Transcripmentioning
confidence: 99%
“…Indeed, these methods assume that (i) only a small fraction of genes are significantly deregulated, and (ii) this deregulation occurs symmetrically around zero (i.e., there is an equal number of up-and down-regulated genes) . While these assumptions may hold true for transcriptional profiling (for which all these methods were initially developed), they can be easily violated with skewed experiments (due to the study design and/or to the nature of the biological samples) such as enrichment experiments (e.g., CHromatin Immuno Precipitation) or sensitive systems where deregulation is global and/or asymmetrical, such as in many studies of the regulation of translation (changes in the "translatome") (Jeanmougin et al 2010;Landfors et al 2011). When these assumptions are violated, general microarray methods will introduce inferential errors leading to spurious deregulation (false positives) and/or censored biological changes (false negatives) (Zhao and Pan 2003;Dabney and Storey 2007).…”
Section: Introductionmentioning
confidence: 99%