Next-generation DNA sequencing promises to revolutionize clinical medicine and basic research. However, while this technology has the capacity to generate hundreds of billions of nucleotides of DNA sequence in a single experiment, the error rate of ∼1% results in hundreds of millions of sequencing mistakes. These scattered errors can be tolerated in some applications but become extremely problematic when "deep sequencing" genetically heterogeneous mixtures, such as tumors or mixed microbial populations. To overcome limitations in sequencing accuracy, we have developed a method termed Duplex Sequencing. This approach greatly reduces errors by independently tagging and sequencing each of the two strands of a DNA duplex. As the two strands are complementary, true mutations are found at the same position in both strands. In contrast, PCR or sequencing errors result in mutations in only one strand and can thus be discounted as technical error. We determine that Duplex Sequencing has a theoretical background error rate of less than one artifactual mutation per billion nucleotides sequenced. In addition, we establish that detection of mutations present in only one of the two strands of duplex DNA can be used to identify sites of DNA damage. We apply the method to directly assess the frequency and pattern of random mutations in mitochondrial DNA from human cells.cancer | diagnostics | subclone | quasispecies | biomarker
Duplex Sequencing (DS) is a next-generation sequencing methodology capable of detecting a single mutation among >1 × 107 wild-type nucleotides, thereby enabling the study of heterogeneous populations and very-low-frequency genetic alterations. DS can be applied to any double-stranded DNA sample, but it is ideal for small genomic regions of <1 Mb in size. The method relies on the ligation of sequencing adapters harboring random yet complementary double-stranded nucleotide sequences to the sample DNA of interest. Individually labeled strands are then PCR-amplified, creating sequence ‘families’ that share a common tag sequence derived from the two original complementary strands. Mutations are scored only if the variant is present in the PCR families arising from both of the two DNA strands. Here we provide a detailed protocol for efficient DS adapter synthesis, library preparation and target enrichment, as well as an overview of the data analysis workflow. The protocol typically takes 1–3 d.
Mitochondrial DNA (mtDNA) is believed to be highly vulnerable to age-associated damage and mutagenesis by reactive oxygen species (ROS). However, somatic mtDNA mutations have historically been difficult to study because of technical limitations in accurately quantifying rare mtDNA mutations. We have applied the highly sensitive Duplex Sequencing methodology, which can detect a single mutation among >107 wild type molecules, to sequence mtDNA purified from human brain tissue from both young and old individuals with unprecedented accuracy. We find that the frequency of point mutations increases ∼5-fold over the course of 80 years of life. Overall, the mutation spectra of both groups are comprised predominantly of transition mutations, consistent with misincorporation by DNA polymerase γ or deamination of cytidine and adenosine as the primary mutagenic events in mtDNA. Surprisingly, G→T mutations, considered the hallmark of oxidative damage to DNA, do not significantly increase with age. We observe a non-uniform, age-independent distribution of mutations in mtDNA, with the D-loop exhibiting a significantly higher mutation frequency than the rest of the genome. The coding regions, but not the D-loop, exhibit a pronounced asymmetric accumulation of mutations between the two strands, with G→A and T→C mutations occurring more often on the light strand than the heavy strand. The patterns and biases we observe in our data closely mirror the mutational spectrum which has been reported in studies of human populations and closely related species. Overall our results argue against oxidative damage being a major driver of aging and suggest that replication errors by DNA polymerase γ and/or spontaneous base hydrolysis are responsible for the bulk of accumulating point mutations in mtDNA.
Mutations, the fuel of evolution, are first manifested as rare DNA changes within a population of cells. Although next-generation sequencing (NGS) technologies have revolutionized the study of genomic variation between species and individual organisms, most have limited ability to accurately detect and quantify rare variants among the different genome copies in heterogeneous mixtures of cells or molecules. We describe the technical challenges in characterizing subclonal variants using conventional NGS protocols and the recent development of error correction strategies, both computational and experimental, including consensus sequencing of single DNA molecules. We also highlight major applications for low-frequency mutation detection in science and medicine, describe emerging methodologies and provide our vision for the future of DNA sequencing.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.