2014
DOI: 10.1186/1471-2105-15-91
|View full text |Cite
|
Sign up to set email alerts
|

Differential meta-analysis of RNA-seq data from multiple studies

Abstract: BackgroundHigh-throughput sequencing is now regularly used for studies of the transcriptome (RNA-seq), particularly for comparisons among experimental conditions. For the time being, a limited number of biological replicates are typically considered in such experiments, leading to low detection power for differential expression. As their cost continues to decrease, it is likely that additional follow-up studies will be conducted to re-address the same biological question.ResultsWe demonstrate how p-value combi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
130
0

Year Published

2015
2015
2024
2024

Publication Types

Select...
8
2

Relationship

0
10

Authors

Journals

citations
Cited by 136 publications
(131 citation statements)
references
References 24 publications
1
130
0
Order By: Relevance
“…For RS, the only group with more than one dataset, we performed a meta-analysis combining three different methods: we used (1) a negative binomial-generalized linear model (NB-GLM), pooling all the samples in all the datasets for one particular senescence-inducing stimulus and comparing them to proliferating counterparts, and within the model, we included a covariate that accounted for inter-laboratory and inter-strain differences (see STAR Methods for details); (2) an analysis of each individual dataset and subsequent combination of the p values using the Fisher method; and (3) an inverse-normal p value combination technique in which each dataset was weighted according to the number of replicates [17, 18]. We set a stringent threshold of nominal p ≤ 0.01 to reduce the odds of false-positive results and retained only those genes that were differentially expressed by the three different methods.…”
Section: Resultsmentioning
confidence: 99%
“…For RS, the only group with more than one dataset, we performed a meta-analysis combining three different methods: we used (1) a negative binomial-generalized linear model (NB-GLM), pooling all the samples in all the datasets for one particular senescence-inducing stimulus and comparing them to proliferating counterparts, and within the model, we included a covariate that accounted for inter-laboratory and inter-strain differences (see STAR Methods for details); (2) an analysis of each individual dataset and subsequent combination of the p values using the Fisher method; and (3) an inverse-normal p value combination technique in which each dataset was weighted according to the number of replicates [17, 18]. We set a stringent threshold of nominal p ≤ 0.01 to reduce the odds of false-positive results and retained only those genes that were differentially expressed by the three different methods.…”
Section: Resultsmentioning
confidence: 99%
“…For this reason, random-effects models are generally a more desirable method to use in gene-expression meta-analysis, where the real goal is to try to discover the background biological ‘true’ effect, rather than simply to synthesize the available data. One drawback of random-effects meta-analysis is that it is not appropriate for count-based data such as RNA-seq (as the count data are not normally distributed), as discussed elsewhere (27,28). However, microarrays are still the dominant genome-wide expression measurement assay: in 2015, 6569 new RNA assays were indexed by ArrayExpress and GEO, of which 2024 were sequencing assays and 4615 were array assays (presumably a very small number of studies had both).…”
Section: Introductionmentioning
confidence: 99%
“…Because secondary structures and nucleic acid binding proteins cause reproducible bisulfite deamination artifacts (Warnecke et al 2002), we could not use raw P-values for subsequent tests, as this would lead to significance for most of these artifacts. However, Fisher's method has shown robustness in a variety of contexts where the underlying assumptions are not strictly met (Derkach et al 2013;Rau et al 2014). We therefore summed the log-transformed www.genome.org…”
Section: Methylation Callingmentioning
confidence: 99%