Background Accurate and complete reference genome assemblies are fundamental for biological research. Cucumber is an important vegetable crop and model system for sex determination and vascular biology. Low-coverage Sanger sequences and high-coverage short Illumina sequences have been used to assemble draft cucumber genomes, but the incompleteness and low quality of these genomes limit their use in comparative genomics and genetic research. A high-quality and complete cucumber genome assembly is therefore essential. Findings We assembled single-molecule real-time (SMRT) long reads to generate an improved cucumber reference genome. This version contains 174 contigs with a total length of 226.2 Mb and an N50 of 8.9 Mb, and provides 29.0 Mb more sequence data than previous versions. Using 10X Genomics and high-throughput chromosome conformation capture (Hi-C) data, 89 contigs (∼211.0 Mb) were directly linked into 7 pseudo-chromosome sequences. The newly assembled regions show much higher guanine-cytosine or adenine-thymine content than found previously, which is likely to have been inaccessible to Illumina sequencing. The new assembly contains 1,374 full-length long terminal retrotransposons and 1,078 novel genes including 239 tandemly duplicated genes. For example, we found 4 tandemly duplicated tyrosylprotein sulfotransferases, in contrast to the single copy of the gene found previously and in most other plants. Conclusion This high-quality genome presents novel features of the cucumber genome and will serve as a valuable resource for genetic research in cucumber and plant comparative genomics.
The botanical family Cucurbitaceae includes a variety of fruit crops with global or local economic importance. How their genomes evolve and the genetic basis of diversity remain largely unexplored. In this study, we sequence the genome of the wax gourd (Benincasa hispida), which bears giant fruit up to 80 cm in length and weighing over 20 kg. Comparative analyses of six cucurbit genomes reveal that the wax gourd genome represents the most ancestral karyotype, with the predicted ancestral genome having 15 proto-chromosomes. We also resequence 146 lines of diverse germplasm and build a variation map consisting of 16 million variations. Combining population genetics and linkage mapping, we identify a number of regions/genes potentially selected during domestication and improvement, some of which likely contribute to the large fruit size in wax gourds. Our analyses of these data help to understand genome evolution and function in cucurbits.
Summary The diploid wild cotton species Gossypium australe possesses excellent traits including resistance to disease and delayed gland morphogenesis, and has been successfully used for distant breeding programmes to incorporate disease resistance traits into domesticated cotton. Here, we sequenced the G. australe genome by integrating PacBio, Illumina short read, BioNano (DLS) and Hi‐C technologies, and acquired a high‐quality reference genome with a contig N50 of 1.83 Mb and a scaffold N50 of 143.60 Mb. We found that 73.5% of the G. australe genome is composed of various repeat sequences, differing from those of G. arboreum (85.39%), G. hirsutum (69.86%) and G. barbadense (69.83%). The G. australe genome showed closer collinear relationships with the genome of G. arboreum than G. raimondii and has undergone less extensive genome reorganization than the G. arboreum genome. Selection signature and transcriptomics analyses implicated multiple genes in disease resistance responses, including GauCCD7 and GauCBP1, and experiments revealed induction of both genes by Verticillium dahliae and by the plant hormones strigolactone (GR24), salicylic acid (SA) and methyl jasmonate (MeJA). Experiments using a Verticillium‐resistant domesticated G. barbadense cultivar confirmed that knockdown of the homologues of these genes caused a significant reduction in resistance against Verticillium dahliae. Moreover, knockdown of a newly identified gland‐associated gene GauGRAS1 caused a glandless phenotype in partial tissues using G. australe. The G. australe genome represents a valuable resource for cotton research and distant relative breeding as well as for understanding the evolutionary history of crop genomes.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.