Aegilops tauschii is the diploid progenitor of the D genome of hexaploid wheat 1 (Triticum aestivum, genomes AABBDD) and an important genetic resource for wheat [2][3][4] . The large size and highly repetitive nature of the Ae. tauschii genome has until now precluded the development of a reference-quality genome sequence 5 .Here we use an array of advanced technologies, including orderedclone genome sequencing, whole-genome shotgun sequencing, and BioNano optical genome mapping, to generate a referencequality genome sequence for Ae. tauschii ssp. strangulata accession AL8/78, which is closely related to the wheat D genome. We show that compared to other sequenced plant genomes, including a much larger conifer genome, the Ae. tauschii genome contains unprecedented amounts of very similar repeated sequences. Our genome comparisons reveal that the Ae. tauschii genome has a greater number of dispersed duplicated genes than other sequenced genomes and its chromosomes have been structurally evolving an order of magnitude faster than those of other grass genomes.
The current limitations in genome sequencing technology require the construction of physical maps for high-quality draft sequences of large plant genomes, such as that of Aegilops tauschii, the wheat D-genome progenitor. To construct a physical map of the Ae. tauschii genome, we fingerprinted 461,706 bacterial artificial chromosome clones, assembled contigs, designed a 10K Ae. tauschii Infinium SNP array, constructed a 7,185-marker genetic map, and anchored on the map contigs totaling 4.03 Gb. Using whole genome shotgun reads, we extended the SNP marker sequences and found 17,093 genes and gene fragments. We showed that collinearity of the Ae. tauschii genes with Brachypodium distachyon, rice, and sorghum decreased with phylogenetic distance and that structural genome evolution rates have been high across all investigated lineages in subfamily Pooideae, including that of Brachypodieae. We obtained additional information about the evolution of the seven Triticeae chromosomes from 12 ancestral chromosomes and uncovered a pattern of centromere inactivation accompanying nested chromosome insertions in grasses. We showed that the density of noncollinear genes along the Ae. tauschii chromosomes positively correlates with recombination rates, suggested a cause, and showed that new genes, exemplified by disease resistance genes, are preferentially located in high-recombination chromosome regions. (2), and 90% of its genome was estimated to be repetitive DNA (3). The Ae. tauschii genome and the D genome of hexaploid wheat are closely related due to the recent origin of hexaploid wheat (4). Ae. tauschii is therefore an important resource for wheat breeding, and its genome is an invaluable reference for wheat genomics, as illustrated by the utility of its sequences in the analysis of the wheat gene space (5). The utility of Ae. tauschii for wheat genetics and genomics would be further enhanced by a high-quality draft sequence of its genome. With current technology, the only approach to produce a high-quality de novo draft sequence for a genome of this size and complexity is the orderedclone sequencing approach, which requires a physical map.Physical map construction necessitates fingerprinting multiple genome equivalents of bacterial artificial chromosome (BAC) clones, assembling them into contigs, and anchoring the contigs on a genetic map (6-8). Great strides have been made in BAC fingerprinting techniques (7, 9-12) and software for fingerprint editing and contig assembly (13-16). It is now possible with these technological advances to fingerprint and assemble contigs from hundreds of thousands of BAC clones (7,8,(17)(18)(19). In contrast, contig anchoring remains a weakness in physical mapping of large plant genomes because of their low gene density, extensive gene duplication, and abundance of repetitive DNA. BAC end sequences (BESs) are an effective means of contig anchoring in small genomes (11). In large genomes, however, hundreds of thousands of BESs are needed. DNA hybridization and PCRbased anchoring (6,7,20,21)...
Until recently, achieving a reference-quality genome sequence for bread wheat was long thought beyond the limits of genome sequencing and assembly technology, primarily due to the large genome size and > 80% repetitive sequence content. The release of the chromosome scale 14.5-Gb IWGSC RefSeq v1.0 genome sequence of bread wheat cv. Chinese Spring (CS) was, therefore, a milestone. Here, we used a direct label and stain (DLS) optical map of the CS genome together with a prior nick, label, repair and stain (NLRS) optical map, and sequence contigs assembled with Pacific Biosciences long reads, to refine the v1.0 assembly. Inconsistencies between the sequence and maps were reconciled and gaps were closed. Gap filling and anchoring of 279 unplaced scaffolds increased the total length of pseudomolecules by 168 Mb (excluding Ns). Positions and orientations were corrected for 233 and 354 scaffolds, respectively, representing 10% of the genome sequence. The accuracy of the remaining 90% of the assembly was validated. As a result of the increased contiguity, the numbers of transposable elements (TEs) and intact TEs have increased in IWGSC RefSeq v2.1 compared with v1.0. In total, 98% of the gene models identified in v1.0 were mapped onto this new assembly through development of a dedicated approach implemented in the MAGAAT pipeline. The numbers of high-confidence genes on pseudomolecules have increased from 105 319 to 105 534. The reconciled assembly enhances the utility of the sequence for genetic mapping, comparative genomics, gene annotation and isolation, and more general studies on the biology of wheat.
Wheat stem rust, caused by the fungus Puccinia graminis f. sp. tritici, afflicts bread wheat (Triticum aestivum). New virulent races collectively referred to as "Ug99" have emerged, which threaten global wheat production. The wheat gene Sr33, introgressed from the wild relative Aegilops tauschii into bread wheat, confers resistance to diverse stem rust races, including the Ug99 race group. We cloned Sr33, which encodes a coiled-coil, nucleotide-binding, leucine-rich repeat protein. Sr33 is orthologous to the barley (Hordeum vulgare) Mla mildew resistance genes that confer resistance to Blumeria graminis f. sp. hordei. The wheat Sr33 gene functions independently of RAR1, SGT1, and HSP90 chaperones. Haplotype analysis from diverse collections of Ae. tauschii placed the origin of Sr33 resistance near the southern coast of the Caspian Sea.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.