2014
DOI: 10.1038/nbt.2957
|View full text |Cite
|
Sign up to set email alerts
|

A comprehensive assessment of RNA-seq accuracy, reproducibility and information content by the Sequencing Quality Control Consortium

Abstract: We present primary results from the Sequencing Quality Control (SEQC) project, coordinated by the United States Food and Drug Administration. Examining Illumina HiSeq, Life Technologies SOLiD and Roche 454 platforms at multiple laboratory sites using reference RNA samples with built-in controls, we assess RNA sequencing (RNA-seq) performance for junction discovery and differential expression profiling and compare it to microarray and quantitative PCR (qPCR) data using complementary metrics. At all sequencing d… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

18
525
0

Year Published

2015
2015
2024
2024

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 865 publications
(573 citation statements)
references
References 50 publications
18
525
0
Order By: Relevance
“…However, the sensitivity and accuracy of RNA‐Seq largely depends on the millions of reads sequenced per sample, the number of replicates used and the filtering and mapping procedures for data processing. For example, in a study comparing RNA‐Seq sequencing depth and the identification of differentially expressed genes, 10 million mapped fragments were sufficient to confirm the differential expression of the most strongly expressed genes but genes with lower expression levels suffered a high FDR (SEQC/MAQC‐III Consortium 2014). Therefore, although the SSH method can yield false positives, the identification of differentially expressed genes with low expression levels by RNA‐Seq requires a greater sequencing depth, and this is more expensive.…”
Section: Discussionmentioning
confidence: 99%
“…However, the sensitivity and accuracy of RNA‐Seq largely depends on the millions of reads sequenced per sample, the number of replicates used and the filtering and mapping procedures for data processing. For example, in a study comparing RNA‐Seq sequencing depth and the identification of differentially expressed genes, 10 million mapped fragments were sufficient to confirm the differential expression of the most strongly expressed genes but genes with lower expression levels suffered a high FDR (SEQC/MAQC‐III Consortium 2014). Therefore, although the SSH method can yield false positives, the identification of differentially expressed genes with low expression levels by RNA‐Seq requires a greater sequencing depth, and this is more expensive.…”
Section: Discussionmentioning
confidence: 99%
“…2d). While technical variation in RNA-seq is known to depend on GC content [8,9], variancePartition gives a clear illustration of how the effect of technical artifacts varies substantially across genes. Moreover, this analysis can be used to identify other correlates underlying technical issues in expression variation.…”
Section: Analysis Of Geuvadis Rna-seq Datasetmentioning
confidence: 99%
“…What is the relative contribution of experimental stimulus versus regulatory genetics to variation in gene expression [5]? Is technical variability of RNA-seq low enough to study regulatory genetics and disease biology, and what are the major drivers of this technical variability [2,8,9]? A rich understanding of complex datasets requires answering these questions with both a genomewide summary and gene-level resolution.…”
Section: Introductionmentioning
confidence: 99%
“…Quantifying sample-to-sample differences (i.e., fold-changes) for each component often provides sufficient information for generating or testing hypotheses, eliminating the need for experimentally more demanding absolute quantification. Such relative quantitation has thus become the standard approach in many analytical disciplines, especially in the analysis of structurally complex biomolecules in highly multiplexed fashion [14]. Although many ingenious quantitation methods have been developed in this context, implementing the more powerful ones (e.g., metabolic labeling) is far from routine, as many of these are difficult and/or expensive to perform [5].…”
Section: Introductionmentioning
confidence: 99%