2014
DOI: 10.1186/gb-2014-15-6-r86
|View full text |Cite
|
Sign up to set email alerts
|

IVT-seq reveals extreme bias in RNA sequencing

Abstract: BackgroundRNA-seq is a powerful technique for identifying and quantifying transcription and splicing events, both known and novel. However, given its recent development and the proliferation of library construction methods, understanding the bias it introduces is incomplete but critical to realizing its value.ResultsWe present a method, in vitro transcription sequencing (IVT-seq), for identifying and assessing the technical biases in RNA-seq library generation and sequencing at scale. We created a pool of over… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

1
102
0

Year Published

2015
2015
2023
2023

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 145 publications
(103 citation statements)
references
References 26 publications
1
102
0
Order By: Relevance
“…Recent large-scale comparisons have been critical of the accuracy of much of the transcript reconstruction process. 38,39 It may be that RNAseq deconvolution algorithms have a preferential bias for shorter transcripts. The set of approximately 3000 genes with reliably identified dominant isoforms that we have generated in this experiment would make an ideal gold standard set for validating RNAseq reconstruction algorithms.…”
Section: Discussionmentioning
confidence: 99%
“…Recent large-scale comparisons have been critical of the accuracy of much of the transcript reconstruction process. 38,39 It may be that RNAseq deconvolution algorithms have a preferential bias for shorter transcripts. The set of approximately 3000 genes with reliably identified dominant isoforms that we have generated in this experiment would make an ideal gold standard set for validating RNAseq reconstruction algorithms.…”
Section: Discussionmentioning
confidence: 99%
“…At the transcript level, sample-specific biases that current methods correct for include the fragment length distribution induced by size selection, positional bias along the transcript due to RNA degradation and mRNA selection techniques, and sequence-based bias in read start positions arising from the differential binding efficiency of random hexamer primers 2,12-16 (Figure 1a and Supplementary Table 1). Even so, it is common to observe extreme variability in the coverage of RNA-seq fragments along transcripts that is purely technical and sample-specific 17 and not explained by current bias models, which confounds current methods designed to identify and quantify transcripts 18 (Figure 1b). …”
mentioning
confidence: 99%
“…We used a benchmarking dataset of 1,062 human in vitro transcribed (IVT) and sequenced cDNA clones mixed at various concentrations with mouse total RNA 17 . We focused our analysis on 64 of the IVT transcripts as exhibiting “high unpredictable coverage” 17 .…”
mentioning
confidence: 99%
See 1 more Smart Citation
“…The RNA was extracted by using the hot-phenol method [7] and DNase I treated to remove contaminating DNA. The RNA samples were not depleted for rRNA prior to sequencing, which tends to eliminate some experimental biases [8]. The RNA samples were shipped on dry ice to vertis Biotechnologie AG (Germany) for library preparation and Illumina HiSeq2000 sequencing, as described by others [7,9].…”
Section: Single-nucleotide Resolved Rna-seq Datasetmentioning
confidence: 99%