The association of genetic variation with disease and drug response, and improvements in nucleic acid technologies, have given great optimism for the impact of 'genomic medicine'. However, the formidable size of the diploid human genome, approximately 6 gigabases, has prevented the routine application of sequencing methods to deciphering complete individual human genomes. To realize the full potential of genomics for human health, this limitation must be overcome. Here we report the DNA sequence of a diploid genome of a single individual, James D. Watson, sequenced to 7.4-fold redundancy in two months using massively parallel sequencing in picolitre-size reaction vessels. This sequence was completed in two months at approximately one-hundredth of the cost of traditional capillary electrophoresis methods. Comparison of the sequence to the reference genome led to the identification of 3.3 million single nucleotide polymorphisms, of which 10,654 cause amino-acid substitution within the coding sequence. In addition, we accurately identified small-scale (2-40,000 base pair (bp)) insertion and deletion polymorphism as well as copy number variation resulting in the large-scale gain and loss of chromosomal segments ranging from 26,000 to 1.5 million base pairs. Overall, these results agree well with recent results of sequencing of a single individual by traditional methods. However, in addition to being faster and significantly less expensive, this sequencing technology avoids the arbitrary loss of genomic sequences inherent in random shotgun sequencing by bacterial cloning because it amplifies DNA in a cell-free system. As a result, we further demonstrate the acquisition of novel human sequence, including novel genes not previously identified by traditional genomic sequencing. This is the first genome sequenced by next-generation technologies. Therefore it is a pilot for the future challenges of 'personalized genome sequencing'.
The National Institutes of Health's Mammalian Gene Collection (MGC) project was designed to generate and sequence a publicly accessible cDNA resource containing a complete open reading frame (ORF) for every human and mouse gene. The project initially used a random strategy to select clones from a large number of cDNA libraries from diverse tissues. Candidate clones were chosen based on 5'-EST sequences, and then fully sequenced to high accuracy and analyzed by algorithms developed for this project. Currently, more than 11,000 human and 10,000 mouse genes are represented in MGC by at least one clone with a full ORF. The random selection approach is now reaching a saturation point, and a transition to protocols targeted at the missing transcripts is now required to complete the mouse and human collections. Comparison of the sequence of the MGC clones to reference genome sequences reveals that most cDNA clones are of very high sequence quality, although it is likely that some cDNAs may carry missense variants as a consequence of experimental artifact, such as PCR, cloning, or reverse transcriptase errors. Recently, a rat cDNA component was added to the project, and ongoing frog (Xenopus) and zebrafish (Danio) cDNA projects were expanded to take advantage of the high-throughput MGC pipeline.
INTRODUCTION The Saccharomyces cerevisiae 2.0 project (Sc2.0) aims to modify the yeast genome with a series of densely spaced designer changes. Both a synthetic yeast chromosome arm (synIXR) and the entirely synthetic chromosome (synIII) function with high fitness in yeast. For designer genome synthesis projects, precise engineering of the physical sequence to match the specified design is important for the systematic evaluation of underlying design principles. Yeast can maintain nuclear chromosomes as rings, occurring by chance at repeated sequences, although the cyclized format is unfavorable in meiosis given the possibility of dicentric chromosome formation from meiotic recombination. Here, we describe the de novo synthesis of synthetic yeast chromosome V (synV) in the “Build-A-Genome China” course, perfectly matching the designer sequence and bearing loxPsym sites, distinguishable watermarks, and all the other features of the synthetic genome. We generated a ring synV derivative with user-specified cyclization coordinates and characterized its performance in mitosis and meiosis. RATIONALE Systematic evaluation of underlying Sc2.0 design principles requires that the final assembled synthetic genome perfectly match the designed sequence. Given the size of yeast chromosomes, synthetic chromosome construction is performed iteratively, and new mutations and unpredictable events may occur during synthesis; even a very small number of unintentional nucleotide changes across the genome could have substantial effects on phenotype. Therefore, precisely matching the physical sequence to the designed sequence is crucial for verification of the design principles in genome synthesis. Ring chromosomes can extend those design principles to provide a model for genomic rearrangement, ring chromosome evolution, and human ring chromosome disorders. RESULTS We chemically synthesized, assembled, and incorporated designer chromosome synV (536,024 base pairs) of S. cerevisiae according to Sc2.0 principles, based on the complete nucleotide sequence of native yeast chromosome V (576,874 base pairs). This work was performed as part of the “Build-A-Genome China” course in Tianjin University. We corrected all mutations found—including duplications, substitutions, and indels—in the initial synV strain by using integrative cotransformation of the precise desired changes and by means of a clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated protein 9 (Cas9)–based method. Altogether, 3331 corrected base pairs were required to match to the designed sequence. We generated a strain that exactly matches all designer sequence changes that displays high fitness under a variety of culture conditions. All corrections were verified with whole-genome sequencing; RNA sequencing revealed only minor changes in gene expression—most notably, decreases in expression of genes relocated near synthetic telomeres as a result of design. We constructed a functional circular synV (ring_synV) derivative in yeast by precisely joining both chromosome ends (telomeres) at specified coordinates. The ring chromosome showed restoration of subtelomeric gene expression levels. The ring_synV strain exhibited fitness comparable with that of the linear synV strain, revealed no change in sporulation frequency, but notably reduced spore viability. In meiosis, heterozygous or homozygous diploid ring_wtV and ring_synV chromosomes behaved similarly, exhibiting substantially higher frequency of the formation of zero-spore tetrads, a type that was not seen in the rod chromosome diploids. Rod synV chromosomes went through meiosis with high spore viability, despite no effort having been made to preserve meiotic competency in the design of synV. CONCLUSION The perfect designer-matched synthetic chromosome V provides strategies to edit sequence variants and correct unpredictable events, such as off-target integration of extra copies of synthetic DNA elsewhere in the genome. We also constructed a ring synthetic chromosome derivative and evaluated its fitness and stability in yeast. Both synV and synVI can be circularized and can power yeast cell growth without affecting fitness when gene content is maintained. These fitness and stability phenotypes of the ring synthetic chromosome in yeast provide a model system with which to probe the mechanism of human ring chromosome disorders. Synthesis, cyclization, and characterization of synV . ( A ) Synthetic chromosome V (synV, 536,024 base pairs) was designed in silico from native chromosome V (wtV, 576,874 base pairs), with extensive genotype modification designed to be phenotypically neutral. ( B ) CRISPR/Cas9 strategy for multiplex repair. ( C ) Colonies of wtV, synV, and ring_synV strains.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.