DNA sequence information underpins genetic research, enabling discoveries of important biological or medical benefit. Sequencing projects have traditionally employed long (400–800 bp) reads, but the existence of reference sequences for the human and many other genomes makes it possible to develop new, fast approaches to re-sequencing, whereby shorter reads are compared to a reference to identify intra-species genetic variation. We report an approach that generates several billion bases of accurate nucleotide sequence per experiment at low cost. Single molecules of DNA are attached to a flat surface, amplified in situ and used as templates for synthetic sequencing with fluorescent reversible terminator deoxyribonucleotides. Images of the surface are analysed to generate high quality sequence. We demonstrate application of this approach to human genome sequencing on flow-sorted X chromosomes and then scale the approach to determine the genome sequence of a male Yoruba from Ibadan, Nigeria. We build an accurate consensus sequence from >30x average depth of paired 35-base reads. We characterise four million SNPs and four hundred thousand structural variants, many of which are previously unknown. Our approach is effective for accurate, rapid and economical whole genome re-sequencing and many other biomedical applications.
We report here on the genome sequence of Pasteurella multocida Razi 0002 of avian origin, isolated in Iran. The genome has a size of 2,289,036 bp, a GC content of 40.3%, and is predicted to contain 2,079 coding sequences.
Ectodermal dysplasias comprise over 150 syndromes of unknown pathogenesis. X-linked anhidrotic ectodermal dysplasia (EDA) is characterized by abnormal hair, teeth and sweat glands. We now describe the positional cloning of the gene mutated in EDA. Two exons, separated by a 200-kilobase intron, encode a predicted 135-residue transmembrane protein. The gene is disrupted in six patients with X;autosome translocations or submicroscopic deletions; nine patients had point mutations. The gene is expressed in keratinocytes, hair follicles, and sweat glands, and in other adult and fetal tissues. The predicted EDA protein may belong to a novel class with a role in epithelial-mesenchymal signalling.
Simpson-Golabi-Behmel syndrome (SGBS) is an X-linked condition characterized by pre- and postnatal overgrowth with visceral and skeletal anomalies. To identify the causative gene, breakpoints in two female patients with X;autosome translocations were identified. The breakpoints occur near the 5' and 3' ends of a gene, GPC3, that spans more than 500 kilobases in Xq26; in three families, different microdeletions encompassing exons cosegregate with SGBS. GPC3 encodes a putative extracellular proteoglycan, glypican 3, that is inferred to play an important role in growth control in embryonic mesodermal tissues in which it is selectively expressed. Initial western- and ligand-blotting experiments suggest that glypican 3 forms a complex with insulin-like growth factor 2 (IGF2), and might thereby modulate IGF2 action.
A high-quality reference genome is a fundamental resource for functional genetics, comparative genomics, and population genomics, and is increasingly important for conservation biology. PacBio Single Molecule, Real-Time (SMRT) sequencing generates long reads with uniform coverage and high consensus accuracy, making it a powerful technology for de novo genome assembly. Improvements in throughput and concomitant reductions in cost have made PacBio an attractive core technology for many large genome initiatives, however, relatively high DNA input requirements (~5 µg for standard library protocol) have placed PacBio out of reach for many projects on small organisms that have lower DNA content, or on projects with limited input DNA for other reasons. Here we present a high-quality de novo genome assembly from a single Anopheles coluzzii mosquito. A modified SMRTbell library construction protocol without DNA shearing and size selection was used to generate a SMRTbell library from just 100 ng of starting genomic DNA. The sample was run on the Sequel System with chemistry 3.0 and software v6.0, generating, on average, 25 Gb of sequence per SMRT Cell with 20 h movies, followed by diploid de novo genome assembly with FALCON-Unzip. The resulting curated assembly had high contiguity (contig N50 3.5 Mb) and completeness (more than 98% of conserved genes were present and full-length). In addition, this single-insect assembly now places 667 (>90%) of formerly unplaced genes into their appropriate chromosomal contexts in the AgamP4 PEST reference. We were also able to resolve maternal and paternal haplotypes for over 1/3 of the genome. By sequencing and assembling material from a single diploid individual, only two haplotypes were present, simplifying the assembly process compared to samples from multiple pooled individuals. The method presented here can be applied to samples with starting DNA amounts as low as 100 ng per 1 Gb genome size. This new low-input approach puts PacBio-based assemblies in reach for small highly heterozygous organisms that comprise much of the diversity of life.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.