Aegilops tauschii is the diploid progenitor of the D genome of hexaploid wheat 1 (Triticum aestivum, genomes AABBDD) and an important genetic resource for wheat [2][3][4] . The large size and highly repetitive nature of the Ae. tauschii genome has until now precluded the development of a reference-quality genome sequence 5 .Here we use an array of advanced technologies, including orderedclone genome sequencing, whole-genome shotgun sequencing, and BioNano optical genome mapping, to generate a referencequality genome sequence for Ae. tauschii ssp. strangulata accession AL8/78, which is closely related to the wheat D genome. We show that compared to other sequenced plant genomes, including a much larger conifer genome, the Ae. tauschii genome contains unprecedented amounts of very similar repeated sequences. Our genome comparisons reveal that the Ae. tauschii genome has a greater number of dispersed duplicated genes than other sequenced genomes and its chromosomes have been structurally evolving an order of magnitude faster than those of other grass genomes.
BackgroundThe size and complexity of conifer genomes has, until now, prevented full genome sequencing and assembly. The large research community and economic importance of loblolly pine, Pinus taeda L., made it an early candidate for reference sequence determination.ResultsWe develop a novel strategy to sequence the genome of loblolly pine that combines unique aspects of pine reproductive biology and genome assembly methodology. We use a whole genome shotgun approach relying primarily on next generation sequence generated from a single haploid seed megagametophyte from a loblolly pine tree, 20-1010, that has been used in industrial forest tree breeding. The resulting sequence and assembly was used to generate a draft genome spanning 23.2 Gbp and containing 20.1 Gbp with an N50 scaffold size of 66.9 kbp, making it a significant improvement over available conifer genomes. The long scaffold lengths allow the annotation of 50,172 gene models with intron lengths averaging over 2.7 kbp and sometimes exceeding 100 kbp in length. Analysis of orthologous gene sets identifies gene families that may be unique to conifers. We further characterize and expand the existing repeat library based on the de novo analysis of the repetitive content, estimated to encompass 82% of the genome.ConclusionsIn addition to its value as a resource for researchers and breeders, the loblolly pine genome sequence and assembly reported here demonstrates a novel approach to sequencing the large and complex genomes of this important group of plants that can now be widely applied.
Because of the huge size of the common wheat (Triticum aestivum L., 2n ϭ 6x ϭ 42, AABBDD) genome of 17,300 Mb, sequencing and mapping of the expressed portion is a logical first step for gene discovery. Here we report mapping of 7104 expressed sequence tag (EST) unigenes by Southern hybridization into a chromosome bin map using a set of wheat aneuploids and deletion stocks. Each EST detected a mean of 4.8 restriction fragments and 2.8 loci. More loci were mapped in the B genome (5774) than in the A (5173) or D (5146) genomes. The EST density was significantly higher for the D genome than for the A or B. In general, EST density increased relative to the physical distance from the centromere. The majority of EST-dense regions are in the distal parts of chromosomes. Most of the agronomically important genes are located in EST-dense regions. The chromosome bin map of ESTs is a unique resource for SNP analysis, comparative mapping, structural and functional analysis, and polyploid evolution, as well as providing a framework for constructing a sequence-ready, BAC-contig map of the wheat genome.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.