A comprehensive assessment of RNA-seq accuracy, reproducibility and information content by the Sequencing Quality Control Consortium

Su, Zhenqiang; Łabaj, Paweł P.; Li, Sheng; Thierry‐Mieg, Jean; Thierry-Mieg, Danielle; Shi, Wei; Wang, Charles; Schroth, Gary P.; Setterquist, Robert A.; Thompson, John F.; Jones, Wendell; Xiao, Wenzhong; Xu, Weihong; Jensen, Roderick V.; Kelly, Reagan; Xu, Joshua; Conesa, Ana; Furlanello, Cesare; Gao, Hanlin; Hong, Huixiao; Jafari, Nadereh; Letovsky, Stan; Liao, Yang; Lü, Fei; Oakeley, Edward J.; Peng, Zhiyu; Praul, Craig A.; Santoyo-López, Javier; Scherer, Andreas; Shi, Tieliu; Smyth, Gordon K.; Staedtler, Frank; Sykacek, Peter; Tan, Xiao; Thompson, E. Aubrey; Vandesompele, Jo; Wang, May D.; Wang, Jian; Wolfinger, Russell D.; Zavadil, Jiří; Auerbach, Scott S.; Bao, Wenjun; Binder, Hans; Blomquist, Thomas M.; Brilliant, Murray H.; Bushel, Pierre R.; Cai, Weimin; Catalano, Jennifer; Chang, Ching Wei; Chen, Tao; Chen, Geng; Chen, Rong; Chierici, Marco; Chu, Tzu Ming; Clevert, Djork-Arné; Deng, Youping; Derti, Adnan; Devanarayan, Viswanath; Dong, Zirui; Dopazo, Joaquín; Du, Tingting; Fang, Hong; Fang, Yongxiang; Fasold, Mario; Fernandez, Anita; Fischer, Matthias; Furió-Tarí, P.; Fuscoe, James C.; Caimet, Florian; Gaj, Stan; Gandara, Jorge; Gao, Huan; Ge, Weigong; Gondo, Yoichi; Gong, Binsheng; Gong, Meihua; Gong, Zhuolin; Green, Bridgett; Guo, Chao; Guo, Lei; Guo, Li; Hadfield, James; Hellemans, Jan; Hochreiter, Sepp; Jia, Meiwen; Jian, Min; Johnson, Charles D.; Kay, Suzanne; Kleinjans, Jos; Lababidi, Samir; Levy, Shawn; Li, Quan Zhen; Li, Li; Li, Peng; Li, Yan; Li, Haiqing; Li, Jianying; Li, Shiyong; Lin, Simon; López, Francisco J.; Lü, Xin; Luo, Heng; Ma, Xiwen; Meehan, Joe; Megherbi, Dalila B.; Mei, Nan; Mu, Bing; Ning, Baitang; Pandey, Akhilesh; Pérez-Florido, Javier; Perkins, Roger; Peters, Ryan M.; Phan, John H.; Pirooznia, Mehdi; Qian, Feng; Qing, Tao; Rainbow, Lucille; Rocca-Serra, Philippe; Sambourg, Laure; Sansone, Susanna-Assunta; Schwartz, Scott; Shah, Ruchir; Shen, Jie; Smith, T. M. F.; Stegle, Oliver; Stralis‐Pavese, Nancy; Stupka, Elia; Suzuki, Yutaka; Szkotnicki, Lee T.; Tinning, Matthew; Tu, Bimeng; Delft, Joost van; Vela-Boza, Alicia; Venturini, Elisa; Walker, Stephen J.; Wan, Liqing; Wang, Wei; Wang, Jinhui; Wang, Jun; Wieben, Eric D.; Willey, James C.; Wu, Po Yen; Xuan, Jiekun; Yang, Yong; Ye, Zhan; Yin, Ye Kui; Yu, Ying; Yuan, Yate Ching; Zhang, Junyi; Zhang, Ke K.; Zhang, Wenqian; Zhang, Wenwei; Zhang, Yanyan; Zhao, Chen; Zheng, Yuanting; Zhou, Yiming; Zumbo, Paul; Tong, Weida; Kreil, David P.; Mason, Christopher E.; Shi, Leming

doi:10.1038/nbt.2957

Cited by 865 publications

(573 citation statements)

References 50 publications

Supporting

Mentioning

525

Contrasting

Order By: Relevance

“…However, the sensitivity and accuracy of RNA‐Seq largely depends on the millions of reads sequenced per sample, the number of replicates used and the filtering and mapping procedures for data processing. For example, in a study comparing RNA‐Seq sequencing depth and the identification of differentially expressed genes, 10 million mapped fragments were sufficient to confirm the differential expression of the most strongly expressed genes but genes with lower expression levels suffered a high FDR (SEQC/MAQC‐III Consortium 2014). Therefore, although the SSH method can yield false positives, the identification of differentially expressed genes with low expression levels by RNA‐Seq requires a greater sequencing depth, and this is more expensive.…”

Section: Discussionmentioning

confidence: 99%

Seasonal phenotype‐specific transcriptional reprogramming during metamorphosis in the European map butterfly Araschnia levana

Vilcinskas

Vogel

2016

Ecology and Evolution

View full text Add to dashboard Cite

The European map butterfly (Araschnia levana) is a classic example of seasonal polyphenism because the spring and summer imagoes display two distinct morphological phenotypes. The light regime and temperature during larval and prepupal development determine whether or not the pupae commit to diapause and overwintering and thus whether spring or summer imagoes emerge. We used suppression subtractive hybridization to experimentally screen for genes that are differentially expressed in prepupae committed either to accelerated metamorphosis and egg production or diapause and overwintering. The range and ontology of the differentially expressed genes in prepupae developing from larvae exposed either to long‐day (LD) or short‐day (SD) conditions revealed fundamental differences. The SD prepupae preferentially expressed genes related to cuticle formation and immunity, reflecting the formation of a robust pupal exoskeleton and the upregulation of antimicrobial peptides as preparations for overwintering. One protein preferentially expressed in SD prepupae has a counterpart in Bombyx mori that functions as a diapause duration clock. The differentially expressed genes in LD prepupae included several members of the dusky and osiris families. We also observed the strong induction of different yellow‐like genes under SD and LD conditions which suggest a role in the developmental choice between seasonal phenotypes. Our transcriptomic data will facilitate the more detailed analysis of molecular mechanisms underlying seasonal polyphenism.

show abstract

Section: Discussionmentioning

confidence: 99%

Seasonal phenotype‐specific transcriptional reprogramming during metamorphosis in the European map butterfly Araschnia levana

Vilcinskas

Vogel

2016

Ecology and Evolution

View full text Add to dashboard Cite

show abstract

“…2d). While technical variation in RNA-seq is known to depend on GC content [8,9], variancePartition gives a clear illustration of how the effect of technical artifacts varies substantially across genes. Moreover, this analysis can be used to identify other correlates underlying technical issues in expression variation.…”

Section: Analysis Of Geuvadis Rna-seq Datasetmentioning

confidence: 99%

“…What is the relative contribution of experimental stimulus versus regulatory genetics to variation in gene expression [5]? Is technical variability of RNA-seq low enough to study regulatory genetics and disease biology, and what are the major drivers of this technical variability [2,8,9]? A rich understanding of complex datasets requires answering these questions with both a genomewide summary and gene-level resolution.…”

Section: Introductionmentioning

confidence: 99%

variancePartition: Interpreting drivers of variation in complex gene expression studies

Hoffman

Schadt

2016

Preprint

118

View full text Add to dashboard Cite

Background: As large-scale studies of gene expression with multiple sources of biological and technical variation become widely adopted, characterizing these drivers of variation becomes essential to understanding disease biology and regulatory genetics. Results: We describe a statistical and visualization framework, variancePartition, to prioritize drivers of variation based on a genome-wide summary, and identify genes that deviate from the genome-wide trend. Using a linear mixed model, variancePartition quantifies variation in each expression trait attributable to differences in disease status, sex, cell or tissue type, ancestry, genetic background, experimental stimulus, or technical variables. Analysis of four large-scale transcriptome profiling datasets illustrates that variancePartition recovers striking patterns of biological and technical variation that are reproducible across multiple datasets. Conclusions:Our open source software, variancePartition, enables rapid interpretation of complex gene expression studies as well as other high-throughput genomics assays. variancePartition is available from Bioconductor: http://bioconductor.org/packages/variancePartition.

show abstract

“…Quantifying sample-to-sample differences (i.e., fold-changes) for each component often provides sufficient information for generating or testing hypotheses, eliminating the need for experimentally more demanding absolute quantification. Such relative quantitation has thus become the standard approach in many analytical disciplines, especially in the analysis of structurally complex biomolecules in highly multiplexed fashion [1–4]. Although many ingenious quantitation methods have been developed in this context, implementing the more powerful ones (e.g., metabolic labeling) is far from routine, as many of these are difficult and/or expensive to perform [5].…”

Section: Introductionmentioning

confidence: 99%

RElative QUantitation Inferred by Evaluating Mixtures (REQUIEM)

Tuomivaara¹,

Schliekelman

Nairn

et al. 2017

Analytica Chimica Acta

View full text Add to dashboard Cite

Motivated by the lack of easily implementable and generally applicable strategies to increase and assess data accuracy, we devised a novel label-free approach, termed REQUIEM, to address challenges in relative quantitation. For comparing the relative amounts of analytes in two samples, a mixture is prepared from aliquots of the samples, and the samples and the mixture are analyzed in parallel according to the intended workflow. Processing of the resulting data using the REQUIEM algorithm yields unbiased analyte fold-changes and associated statistics, allowing several types of errors to be diagnosed or eliminated. Extensive simulations and analysis of carefully prepared standard samples demonstrated the rigorous foundations of REQUIEM. We applied REQUIEM to several real-world analytical techniques and workflows, notably to tandem mass spectrometry analysis by using isomeric oligosaccharides as test analytes. We conclude that REQUIEM can reveal inaccuracies in the data that are difficult to identify by using traditional approaches.

show abstract

A comprehensive assessment of RNA-seq accuracy, reproducibility and information content by the Sequencing Quality Control Consortium

Cited by 865 publications

References 50 publications

Seasonal phenotype‐specific transcriptional reprogramming during metamorphosis in the European map butterfly Araschnia levana

Seasonal phenotype‐specific transcriptional reprogramming during metamorphosis in the European map butterfly Araschnia levana

variancePartition: Interpreting drivers of variation in complex gene expression studies

RElative QUantitation Inferred by Evaluating Mixtures (REQUIEM)

Contact Info

Product

Resources

About