About 8,000 years ago in the Fertile Crescent, a spontaneous hybridization of the wild diploid grass Aegilops tauschii (2n 5 14; DD) with the cultivated tetraploid wheat Triticum turgidum (2n 5 4x 5 28; AABB) resulted in hexaploid wheat (T. aestivum; 2n 5 6x 5 42; AABBDD) 1,2 . Wheat has since become a primary staple crop worldwide as a result of its enhanced adaptability to a wide range of climates and improved grain quality for the production of baker's flour 2 . Here we describe sequencing the Ae. tauschii genome and obtaining a roughly 90-fold depth of short reads from libraries with various insert sizes, to gain a better understanding of this genetically complex plant. The assembled scaffolds represented 83.4% of the genome, of which 65.9% comprised transposable elements. We generated comprehensive RNA-Seq data and used it to identify 43,150 protein-coding genes, of which 30,697 (71.1%) were uniquely anchored to chromosomes with an integrated high-density genetic map. Whole-genome analysis revealed gene family expansion in Ae. tauschii of agronomically relevant gene families that were associated with disease resistance, abiotic stress tolerance and grain quality. This draft genome sequence provides insight into the environmental adaptation of bread wheat and can aid in defining the large and complicated genomes of wheat species.We selected Ae. tauschii accession AL8/78 for genome sequencing because it has been extensively characterized genetically (Supplementary Information). Using a whole genome shotgun strategy, we generated 398 Gb of high-quality reads from 45 libraries with insert sizes ranging from 200 bp to 20 kb (Supplementary Information). A hierarchical, iterative assembly of short reads employing the parallelized sequence assembler SOAPdenovo 3 achieved contigs with an N50 length (minimum length of contigs representing 50% of the assembly) of 4,512 bp (Table 1). Paired-end information combined with an additional 18.4 Gb of Roche/454 long-read sequences was used sequentially to generate 4.23-Gb scaffolds (83.4% were non-gapped contiguous sequences) with an N50 length of 57.6 kb (Supplementary Information). The assembly represented 97% of the 4.36-Gb genome as estimated by K-mer analysis (Supplementary Information). We also obtained 13,185 Ae. tauschii expressed sequence tag (EST) sequences using Sanger sequencing, of which 11,998 (91%) could be mapped to the scaffolds with more than 90% coverage (Supplementary Information).To aid in gene identification, we performed RNA-Seq (53.2 Gb for a 117-Mb transcriptome assembly) on 23 libraries representing eight tissues including pistil, root, seed, spike, stamen, stem, leaf and sheath (Supplementary Information). Using both evidence-based and de novo gene predictions, we identified 34,498 high-confidence protein-coding loci. FGENESH 4 and GeneID models were supported by a 60% overlap with either our ESTs and RNA-Seq reads, or with homologous proteins. More than 76% of the gene models had a significant match (E value # 10 25; alignment length $ 60%) in the ...
The OsGW2 gene is involved in rice grain development, influencing grain width and weight. Its ortholog in wheat, TaGW2, was considered as a candidate gene related to grain development. We found that TaGW2 is constitutively expressed, with three orthologs expressing simultaneously. The coding sequence (CDS) of TaGW2 is 1,275 bp encoding a protein with 424 amino acids, and has a functional domain shared with OsGW2. No divergence was detected within the CDS sequences in the same locus in ten varieties. Genome-specific primers were designed based on the sequence divergence of the promoter regions in the three orthologous genes, and TaGW2 was located in homologous group 6 chromosomes through CS nulli-tetrasomic (NT). Two SNPs were detected in the promoter region of TaGW2-6A, forming two haplotypes: Hap-6A-A (-593A and -739G) and Hap-6A-G (-593G and -769A). A cleaved amplified polymorphic sequence (CAPS) marker was developed based on the -593 A-G polymorphism to distinguish the two haplotypes in TaGW2-6A. This gene was fine mapped 0.6 cM from marker cfd80.2 near the centromere in a recombinant inbred line (RIL) population. Two hundred sixty-five Chinese wheat varieties were genotyped and association analysis revealed that Hap-6A-A was significantly associated with wider grains and higher one-thousand grain weight (TGW) in two crop seasons. qRT-PCR revealed a negative relationship between TaGW2 expression level and grain width. The Hap-6A-A frequencies in Chinese varieties released at different periods showed that it had been strongly positively selected in breeding. In landraces, Hap-6A-A is mainly distributed in southern Chinese wheat regions. Association analysis also indicated that Hap-6A-A not only increased TGW by more than 3 g, but also had earlier heading and maturity. In contrast to Chinese varieties, Hap-6A-G was the predominant haplotype in European varieties; Hap-6A-A was mainly present in varieties released in the former Yugoslavia, Italy, Bulgaria, Hungary and Portugal.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.