2009
DOI: 10.1186/1471-2105-10-45
|View full text |Cite
|
Sign up to set email alerts
|

Comparison of small n statistical tests of differential expression applied to microarrays

Abstract: Background: DNA microarrays provide data for genome wide patterns of expression between observation classes. Microarray studies often have small samples sizes, however, due to cost constraints or specimen availability. This can lead to poor random error estimates and inaccurate statistical tests of differential expression. We compare the performance of the standard t-test, fold change, and four small n statistical test methods designed to circumvent these problems. We report results of various normalization me… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
57
0
1

Year Published

2010
2010
2019
2019

Publication Types

Select...
7
2
1

Relationship

0
10

Authors

Journals

citations
Cited by 72 publications
(58 citation statements)
references
References 43 publications
0
57
0
1
Order By: Relevance
“…Some of the microarray studies we looked at have small sample sizes, which gives rise to the possibility of poor random error estimates and inaccurate statistical tests for differential expression. For this reason, we selected limma t -statistics, an empirical Bayes method [52], which is reportedly one of the most effective methods for differential expression analysis even for very small data sets [53]. To find the combined significance of the pathways across multiple diseases, we used Fisher’s combined probability test [39], because, it gives a single test of significance for a number of not-so-correlated tests of significance performed on very heterogeneous data sets.…”
Section: Discussionmentioning
confidence: 99%
“…Some of the microarray studies we looked at have small sample sizes, which gives rise to the possibility of poor random error estimates and inaccurate statistical tests for differential expression. For this reason, we selected limma t -statistics, an empirical Bayes method [52], which is reportedly one of the most effective methods for differential expression analysis even for very small data sets [53]. To find the combined significance of the pathways across multiple diseases, we used Fisher’s combined probability test [39], because, it gives a single test of significance for a number of not-so-correlated tests of significance performed on very heterogeneous data sets.…”
Section: Discussionmentioning
confidence: 99%
“…Following normalization, average signal intensity of the probes showing at least 30% change in expression across the 3 donors were computed and ratios (treated/untreated) were log 2 transformed. Statistical analysis of the data was performed using Cyber-T regularized t statistic [14] due to small sample size (n = 3) since it takes into account Bayesian estimate of variance by pooling across genes with similar intensities [15]. The functional annotation tool (DAVID Bioinformatics Resources 6.7) was used to determine the biological relevance of the data and molecular functions represented by differentially regulated genes [16], enabling us to explore and clarify the biological process, by considering a p-value ≤0.05 as significant.…”
Section: Microarray Expression Data Analysismentioning
confidence: 99%
“…More empirical alternatives include the use of re-sampling methods (to compare genes from small subsets of samples and those from the full dataset) [3], [19], and the use of spike-in data for which a set of genes are differentially expressed by design [12], [20]. Finally Jeffery et al [18] explore an indirect approach by assessing classification performance obtained with genes resulting from the application of the methods to compare.…”
Section: Introductionmentioning
confidence: 99%