2004
DOI: 10.1093/bioinformatics/bth924
|View full text |Cite
|
Sign up to set email alerts
|

Statistical modeling of sequencing errors in SAGE libraries

Abstract: An implementation using R is distributed as an R package. An online version is available at http://tagcalling.mbgproject.org

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
91
0

Year Published

2006
2006
2020
2020

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 99 publications
(91 citation statements)
references
References 13 publications
0
91
0
Order By: Relevance
“…Also, sequence neighborhood-based correction methods developed for SAGE libraries (Akmaev and Wang 2004;Beissbarth et al 2004) may be adapted to correct sequence bias in next-generation sequencing data. However, although these algorithms may succeed somewhat in correcting sequencing bias, they are also susceptible to over-correction of low abundant miRNA sequences that are similar to high abundant reads with reduced sensitivity as a consequence.…”
Section: Sequencing Data ''Sequence Variation''mentioning
confidence: 99%
“…Also, sequence neighborhood-based correction methods developed for SAGE libraries (Akmaev and Wang 2004;Beissbarth et al 2004) may be adapted to correct sequence bias in next-generation sequencing data. However, although these algorithms may succeed somewhat in correcting sequencing bias, they are also susceptible to over-correction of low abundant miRNA sequences that are similar to high abundant reads with reduced sensitivity as a consequence.…”
Section: Sequencing Data ''Sequence Variation''mentioning
confidence: 99%
“…Sequencing errors can still arise, and estimates of such errors can be generated. Statistical algorithms have been created to correct for these errors [57], but no all-inclusive solution to these problems have yet been reported for SAGE expression data. An ideal method should be able to indicate the degree of normalisation of a normalised library or for cancer staging.…”
Section: Techniques For Measuring Gene Expression Levels and Their LImentioning
confidence: 99%
“…The model we propose is similar to that of RECOUNT [Wijaya et al, 2009] used to correct next generation short read counts. Both models derive from a method originally meant to detect sequencing errors in SAGE libraries [Beissbarth et al, 2004]. Our model differs from the previous models in that it works with kmers rather than full reads, since there is insufficient replication of full length reads in genomic data (as compared to transcriptome data).…”
Section: Preliminariesmentioning
confidence: 99%
“…The update equations are adapted from Beissbarth et al [2004] using a different error model and are given as follows:…”
Section: Error Modelmentioning
confidence: 99%