2019
DOI: 10.1186/s13059-018-1605-z
|View full text |Cite
|
Sign up to set email alerts
|

A comparative evaluation of hybrid error correction methods for error-prone long reads

Abstract: BackgroundThird-generation sequencing technologies have advanced the progress of the biological research by generating reads that are substantially longer than second-generation sequencing technologies. However, their notorious high error rate impedes straightforward data analysis and limits their application. A handful of error correction methods for these error-prone long reads have been developed to date. The output data quality is very important for downstream analysis, whereas computing resources could li… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
95
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 103 publications
(95 citation statements)
references
References 56 publications
0
95
0
Order By: Relevance
“…Long read technologies could in theory enable 100% accurate isoform quan-tification, if issues due to a high base calling error rate could be overcome [23]. However we find that even when no isoform detection errors occur, our ability to accurately detect isoforms is very limited.…”
Section: Discussionmentioning
confidence: 87%
“…Long read technologies could in theory enable 100% accurate isoform quan-tification, if issues due to a high base calling error rate could be overcome [23]. However we find that even when no isoform detection errors occur, our ability to accurately detect isoforms is very limited.…”
Section: Discussionmentioning
confidence: 87%
“…Long-read sequencing technologies, particularly with the introduction of the Sequel I and II platforms (Pacific Biosciences), now have high enough volume that new assemblies are being constructed purely from long reads. Although long reads currently have high indel-derived error rates, those errors can be corrected with sufficient sequencing depth (Fu et al 2019). These assemblies have huge N50 measures, and typically contain very few or even no gaps because very limited or no scaffolding is employed, although repeats occasionally result in mis-joins of contigs.…”
Section: Resultsmentioning
confidence: 99%
“…Second, the random error rate of TGS was reported to be more than 10%, which is higher than that in NGS. [44] This may affect the results of downstream analyses, including the accuracy of genetic variation identification. To resolve this problem, the accuracy could be improved to more than 99% by increasing the sequencing depth by using more SMRT cells.…”
Section: Unsolved Problemsmentioning
confidence: 99%