2014
DOI: 10.1093/bib/bbu029
|View full text |Cite
|
Sign up to set email alerts
|

Correcting Illumina data

Abstract: Next-generation sequencing technologies revolutionized the ways in which genetic information is obtained and have opened the door for many essential applications in biomedical sciences. Hundreds of gigabytes of data are being produced, and all applications are affected by the errors in the data. Many programs have been designed to correct these errors, most of them targeting the data produced by the dominant technology of Illumina. We present a thorough comparison of these programs. Both HiSeq and MiSeq types … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
56
0
2

Year Published

2016
2016
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 37 publications
(59 citation statements)
references
References 32 publications
1
56
0
2
Order By: Relevance
“…In contrast, the previous Sanger sequencing technology, used to decipher the human genome, required over a decade to deliver the final draft [9]. For analyzing NGS data, some algorithms and software were developed for genome assembly [54], read correction [91], and short read mapping [31].…”
Section: Dna Genes and Chromosomesmentioning
confidence: 99%
“…In contrast, the previous Sanger sequencing technology, used to decipher the human genome, required over a decade to deliver the final draft [9]. For analyzing NGS data, some algorithms and software were developed for genome assembly [54], read correction [91], and short read mapping [31].…”
Section: Dna Genes and Chromosomesmentioning
confidence: 99%
“…The tests were performed for three datasets described in Table 2. Two of this datasets were taken from the survey by Molnar and Ilie (2015).…”
Section: Real Data Evaluationmentioning
confidence: 99%
“…Therefore, the correction of errors in reads is currently an important and popular issue. The existing solutions are discussed and compared in the recent surveys: (Yang et al, 2013), (Molnar and Ilie, 2015), (Laehnemann et al, 2015). Yang et al classify the correction algorithms into three groups: (i) kspectrum-based, (ii) suffix-tree/array-based, and (iii) multiple-sequencealignment-based.…”
Section: Introductionmentioning
confidence: 99%
“…There are only three reviews [1,2,79] published, with the latest one targeting only Illumina data. They focus on benchmarking the correctors, while this article extracts key information from the work in this field, summing up all important points deemed as important.…”
Section: Approachmentioning
confidence: 99%
“…For instance, Coral [15] is listed in this review in the category of msb algorithms [1,2,79], but it also uses the kmer spectrum to determine the related reads. The same case arises with Premier [136] and Premier Turbo [137] which use k-mers to update the probabilities for the variants on a position.…”
Section: Probabilistic Models Based (Pmb)mentioning
confidence: 99%