2019
DOI: 10.1186/s13059-019-1659-6
|View full text |Cite
|
Sign up to set email alerts
|

Analysis of error profiles in deep next-generation sequencing data

Abstract: BackgroundSequencing errors are key confounding factors for detecting low-frequency genetic variants that are important for cancer molecular diagnosis, treatment, and surveillance using deep next-generation sequencing (NGS). However, there is a lack of comprehensive understanding of errors introduced at various steps of a conventional NGS workflow, such as sample handling, library preparation, PCR enrichment, and sequencing. In this study, we use current NGS technology to systematically investigate these quest… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

7
238
2

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
4
2

Relationship

0
10

Authors

Journals

citations
Cited by 248 publications
(247 citation statements)
references
References 38 publications
7
238
2
Order By: Relevance
“…In seeking alternative, non-biological explanations for this observation, they cannot have arisen through misincorporation errors in the next generation sequence methods used to produce the dataset because the analysis in the current was restricted to consensus sequences. These are generally assembled from libraries that typically possess reasonable coverage and read depth; error frequencies of < 10 -4 per site (14) would therefore improbably create 170 a consensus change in a sequence library. There was furthermore, no comparable increase in G->A mutations ( Fig.…”
Section: Discussion 155mentioning
confidence: 99%
“…In seeking alternative, non-biological explanations for this observation, they cannot have arisen through misincorporation errors in the next generation sequence methods used to produce the dataset because the analysis in the current was restricted to consensus sequences. These are generally assembled from libraries that typically possess reasonable coverage and read depth; error frequencies of < 10 -4 per site (14) would therefore improbably create 170 a consensus change in a sequence library. There was furthermore, no comparable increase in G->A mutations ( Fig.…”
Section: Discussion 155mentioning
confidence: 99%
“…Therefore, Substitution error rates are more significant than deletion and insertion. The substitution error rate by conventional NGS was reported to be >1% in 2011 and was found to be similar in later reports [5]. Some groups tried to systematically investigate substitution error profile by analysing multiple datasets [10].…”
Section: Introductionmentioning
confidence: 74%
“…Unfortunately, these sequences are not error-free. Different sequencing technologies are characterized by different types and frequency of errors (Ma et al, 2019). Often these sequencing errors are not random and are typical for certain sets of nucleotides such as homopolymers.…”
Section: Introductionmentioning
confidence: 99%