2019
DOI: 10.1038/s41467-019-09026-y
|View full text |Cite
|
Sign up to set email alerts
|

The use of technical replication for detection of low-level somatic mutations in next-generation sequencing

Abstract: Accurate genome-wide detection of somatic mutations with low variant allele frequency (VAF, <1%) has proven difficult, for which generalized, scalable methods are lacking. Herein, we describe a new computational method, called RePlow, that we developed to detect low-VAF somatic mutations based on simple, library-level replicates for next-generation sequencing on any platform. Through joint analysis of replicates, RePlow is able to remove prevailing background errors in next-generation sequencing analysis, faci… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

3
57
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
8

Relationship

4
4

Authors

Journals

citations
Cited by 54 publications
(60 citation statements)
references
References 37 publications
3
57
0
Order By: Relevance
“…We aligned raw sequences from Fastq files to the hg19/GRCh37 assembly of the human genome reference sequence using BWA-MEM ( http://bio-bwa.sourceforge.net ). After recommended preprocessing, indel realignment, and base recalibration, we utilized RePlow ( https://sourceforge.net/projects/replow/ ), a variant caller designed to detect low-level somatic variants from replicated sample data [ 13 ]. We then applied in-house filtering criteria: 1) excluding registered common variants in a public database (common dbSNP147; https://genome.ucsc.edu/cgi-bin/hgTrackUi?db=hg38&g=snp147Common ); 2) excluding variants with a putative low snpEFF impact score ( http://snpeff.sourceforge.net/SnpEff.html#intro ); 3) excluding variants with a PolyPhen & SIFT ≠ Damaging, phastCons score < 0.9; and 4) excluding variants with an allele frequency > 0.1% in the ExAC and gnomAD databases (both from https://gnomad.broadinstitute.org ) for minor allele frequencies of general and East Asian populations [ 14 ].…”
Section: Methodsmentioning
confidence: 99%
“…We aligned raw sequences from Fastq files to the hg19/GRCh37 assembly of the human genome reference sequence using BWA-MEM ( http://bio-bwa.sourceforge.net ). After recommended preprocessing, indel realignment, and base recalibration, we utilized RePlow ( https://sourceforge.net/projects/replow/ ), a variant caller designed to detect low-level somatic variants from replicated sample data [ 13 ]. We then applied in-house filtering criteria: 1) excluding registered common variants in a public database (common dbSNP147; https://genome.ucsc.edu/cgi-bin/hgTrackUi?db=hg38&g=snp147Common ); 2) excluding variants with a putative low snpEFF impact score ( http://snpeff.sourceforge.net/SnpEff.html#intro ); 3) excluding variants with a PolyPhen & SIFT ≠ Damaging, phastCons score < 0.9; and 4) excluding variants with an allele frequency > 0.1% in the ExAC and gnomAD databases (both from https://gnomad.broadinstitute.org ) for minor allele frequencies of general and East Asian populations [ 14 ].…”
Section: Methodsmentioning
confidence: 99%
“…The gold standard for BRAF mutation analysis is molecular biology (DNA-based) techniques, with classical Sanger sequencing being the most commonly used [41,42]. However, in the case of CNS tumors, in particular ganglioglioma, the relevance of the latter method has recently been questioned [40,43], due to its high detection threshold of 20% allele frequency, which does not work well with a low number and/or scattered distribution of neoplastic cells [44]. Although other molecular biology techniques, such as pyrosequencing, next-generation sequencing, PNA-clamping PCR, ddPCR, and ASqPCR, have higher sensitivity (detection of 0.02-10% mutant in a background of wild type), all DNA-based methods are often expensive, labor-intensive, time-consuming, and not widely available in pathology laboratories, also because they require a highly skilled operator and complex infrastructure [45,46].…”
Section: Discussionmentioning
confidence: 99%
“…The output is typically in the Variant Call Format (VCF) so as to be represented in structured text-based formats. 17 The RePlow method can detect somatic variants by estimations of the background and variant models for a given observation. Furthermore, to determine high-confidence variants, we performed some filtering steps based on depths (total depth ≥15, alternative allele depth ≥5) and functional impact (e.g., synonymous or nonsynonymous).…”
Section: Discussionmentioning
confidence: 99%
“…16 To detect somatic mutations among all high-confidence variants, we used the RePlow method that jointly analyzes library-level replicates for accurate detection of low-VAF somatic mutations. 17 The RePlow method can detect somatic variants by estimations of the background and variant models for a given observation. Copy number alterations (CNAs) were called by the CNV detection tool of the GATK4 beta version.…”
Section: Discussionmentioning
confidence: 99%