Allotetraploid cotton is an economically important natural-fiber-producing crop worldwide. After polyploidization, Gossypium hirsutum L. evolved to produce a higher fiber yield and to better survive harsh environments than Gossypium barbadense, which produces superior-quality fibers. The global genetic and molecular bases for these interspecies divergences were unknown. Here we report high-quality de novo-assembled genomes for these two cultivated allotetraploid species with pronounced improvement in repetitive-DNA-enriched centromeric regions. Whole-genome comparative analyses revealed that speciesspecific alterations in gene expression, structural variations and expanded gene families were responsible for speciation and the evolutionary history of these species. These findings help to elucidate the evolution of cotton genomes and their domestication history. The information generated not only should enable breeders to improve fiber quality and resilience to ever-changing environmental conditions but also can be translated to other crops for better understanding of their domestication history and use in improvement.
SummaryNatural antisense transcripts (NATs) are commonly observed in eukaryotic genomes, but only a limited number of such genes have been identified as being involved in gene regulation in plants. In this research, we investigated the function of small RNA derived from a NAT in fiber cell development.Using a map-based cloning strategy for the first time in tetraploid cotton, we cloned a naked seed mutant gene (N 1 ) encoding a MYBMIXTA-like transcription factor 3 (MML3)/GhMYB25-like in chromosome A12, GhMML3_A12, that is associated with fuzz fiber development.The extremely low expression of GhMML3_A12 in N 1 is associated with NAT production, driven by its 3 0 antisense promoter, as indicated by the promoter-driven histochemical staining assay. In addition, small RNA deep sequencing analysis suggested that the bidirectional transcriptions of GhMML3_A12 form double-stranded RNAs and generate 21-22 nt small RNAs. Therefore, in a fiber-specific manner, small RNA derived from the GhMML3_A12 locus can mediate GhMML3_A12 mRNA self-cleavage and result in the production of naked seeds followed by lint fiber inhibition in N 1 plants. The present research reports the first observation of gene-mediated NATs and siRNA directly controlling fiber development in cotton.
BackgroundCotton has been cultivated and used to make fabrics for at least 7000 years. Two allotetraploid species of great commercial importance, Gossypium hirsutum and Gossypium barbadense, were domesticated after polyploidization and are cultivated worldwide. Although the overall genetic diversity between these two cultivated species has been studied with limited accessions, their population structure and genetic variations remain largely unknown.ResultsWe resequence the genomes of 147 cotton accessions, including diverse wild relatives, landraces, and modern cultivars, and construct a comprehensive variation map to provide genomic insights into the divergence and dual domestication of these two important cultivated tetraploid cotton species. Phylogenetic analysis shows two divergent groups for G. hirsutum and G. barbadense, suggesting a dual domestication processes in tetraploid cottons. In spite of the strong genetic divergence, a small number of interspecific reciprocal introgression events are found between these species and the introgression pattern is significantly biased towards the gene flow from G. hirsutum into G. barbadense. We identify selective sweeps, some of which are associated with relatively highly expressed genes for fiber development and seed germination.ConclusionsWe report a comprehensive analysis of the evolution and domestication history of allotetraploid cottons based on the whole genomic variation between G. hirsutum and G. barbadense and between wild accessions and modern cultivars. These results provide genomic bases for improving cotton production and for further evolution analysis of polyploid crops.Electronic supplementary materialThe online version of this article (doi:10.1186/s13059-017-1167-5) contains supplementary material, which is available to authorized users.
BackgroundSNPs are the most abundant polymorphism type, and have been explored in many crop genomic studies, including rice and maize. SNP discovery in allotetraploid cotton genomes has lagged behind that of other crops due to their complexity and polyploidy. In this study, genome-wide SNPs are detected systematically using next-generation sequencing and efficient SNP genotyping methods, and used to construct a linkage map and characterize the structural variations in polyploid cotton genomes.ResultsWe construct an ultra-dense inter-specific genetic map comprising 4,999,048 SNP loci distributed unevenly in 26 allotetraploid cotton linkage groups and covering 4,042 cM. The map is used to order tetraploid cotton genome scaffolds for accurate assembly of G. hirsutum acc. TM-1. Recombination rates and hotspots are identified across the cotton genome by comparing the assembled draft sequence and the genetic map. Using this map, genome rearrangements and centromeric regions are identified in tetraploid cotton by combining information from the publicly-available G. raimondii genome with fluorescent in situ hybridization analysis.ConclusionsWe report the genotype-by-sequencing method used to identify millions of SNPs between G. hirsutum and G. barbadense. We construct and use an ultra-dense SNP map to correct sequence mis-assemblies, merge scaffolds into pseudomolecules corresponding to chromosomes, detect genome rearrangements, and identify centromeric regions in allotetraploid cottons. We find that the centromeric retro-element sequence of tetraploid cotton derived from the D subgenome progenitor might have invaded the A subgenome centromeres after allotetrapolyploid formation. This study serves as a valuable genomic resource for genetic research and breeding of cotton.Electronic supplementary materialThe online version of this article (doi:10.1186/s13059-015-0678-1) contains supplementary material, which is available to authorized users.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.