2011
DOI: 10.1186/1471-2105-12-290
|View full text |Cite
|
Sign up to set email alerts
|

Bias detection and correction in RNA-Sequencing data

Abstract: BackgroundHigh throughput sequencing technology provides us unprecedented opportunities to study transcriptome dynamics. Compared to microarray-based gene expression profiling, RNA-Seq has many advantages, such as high resolution, low background, and ability to identify novel transcripts. Moreover, for genes with multiple isoforms, expression of each isoform may be estimated from RNA-Seq data. Despite these advantages, recent work revealed that base level read counts from RNA-Seq data may not be randomly distr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

2
127
0

Year Published

2014
2014
2019
2019

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 149 publications
(129 citation statements)
references
References 34 publications
2
127
0
Order By: Relevance
“…Recently, there have been several investigations [13][14][15] into the biases that affect the accuracy with which RNA-Seq represents the absolute abundance of a given transcript as measured by high precision approaches such as Taqman RT-PCR [16]. It has been shown that these abundance measures are prone to biases correlated with the nucleotide composition [14,17] and length of the transcript [1,18].…”
mentioning
confidence: 99%
See 1 more Smart Citation
“…Recently, there have been several investigations [13][14][15] into the biases that affect the accuracy with which RNA-Seq represents the absolute abundance of a given transcript as measured by high precision approaches such as Taqman RT-PCR [16]. It has been shown that these abundance measures are prone to biases correlated with the nucleotide composition [14,17] and length of the transcript [1,18].…”
mentioning
confidence: 99%
“…It has been shown that these abundance measures are prone to biases correlated with the nucleotide composition [14,17] and length of the transcript [1,18]. Several within and between sample correction and normalisation procedures have recently been developed to address these biases either as nucleotide composition effects [17] or various combinations of nucleotide, length or library preparation biases [14,15].…”
mentioning
confidence: 99%
“…Thus, it is likely to have a higher level of expression rather than shorter transcripts due to this technical problem and not due to a real activation or inactivation of the transcription (Zheng et al 2011;Oshlack and Wakefield 2009). …”
Section: Normalizationmentioning
confidence: 99%
“…This model assumes that the reads are distributed uniformly across the genome, and hence reads are sampled independently and uniformly from every possible nucleotide in the sample (Vardhanabhuti et al, 2013). However, due to a number of biases, such as the 5'-and 3'-end biases, priming or GC bias, and so on, the uniform assumption of read distribution is untenable, and moreover, these various biases cause the nonuniform read distribution to be illogical, which causes the non-uniformity read distribution (Hansen et al, 2010;Zheng et al, 2011). This leads to difficulties in estimating isoform express levels.…”
Section: Introductionmentioning
confidence: 99%