Solanum pennellii is a wild tomato species endemic to Andean regions in South America, where it has evolved to thrive in arid habitats. Because of its extreme stress tolerance and unusual morphology, it is an important donor of germplasm for the cultivated tomato Solanum lycopersicum 1 . Introgression lines (ILs) in which large genomic regions of S. lycopersicum are replaced with the corresponding segments from S. pennellii can show remarkably superior agronomic performance 2 . Here we describe a high-quality genome assembly of the parents of the IL population. By anchoring the S. pennellii genome to the genetic map, we define candidate genes for stress tolerance and provide evidence that transposable elements had a role in the evolution of these traits. Our work paves a path toward further tomato improvement and for deciphering the mechanisms underlying the myriad other agronomic traits that can be improved with S. pennellii germplasm.Crosses between distantly related plants can lead to substantial improvements in performance. Notably, S. pennellii × S. lycopersicum ILs have been used to define numerous quantitative trait loci (QTLs) for superior yield, chemical composition, morphology, abiotic stress tolerance and extreme heterosis 3,4 . Although genetic studies have proven informative, few genes underlying specific QTLs have been cloned, largely because of the lack of a S. pennellii genome sequence. To support QTL analyses, we sequenced the genome of S. pennellii using Illumina sequencing with ~190-fold coverage ( Fig. 1 and Supplementary Tables 1-5). The initial assembly size was 942 Mb, with a scaffold N50 value of 1.7 Mb and N90 value of 0.43 Mb (Table 1 and Supplementary Tables 6 and 7). We estimated the total genome size to be about 1.2 Gb using a k-mer-based analysis ( Supplementary Fig. 1 and Supplementary Table 8), in accordance with previous estimations 3,4 . We anchored 97.1% of the genome assembly to chromosomes using genetic maps and restriction site-associated DNA sequencing (RAD-seq)-based markers from the IL population 5 (Supplementary Note). Comparison of the assembly to publicly available BAC sequences indicated an accuracy of >99.9%, and a satisfactory accuracy of gap-filled regions was shown by realigning reads (Supplementary Fig. 2 and Supplementary Table 9). Of the 307,350 S. lycopersicum and 7,812 S. pennellii publicly available ESTs, 93% and >96% could be aligned to the genome, respectively (Supplementary Table 10), indicating comprehensive coverage of the gene-rich regions. We predicted 32,273 high-confidence genes and a potential set of 44,966 protein-coding genes and checked these
Although applied over extremely short timescales, artificial selection has dramatically altered the form, physiology, and life history of cultivated plants. We have used RNAseq to define both gene sequence and expression divergence between cultivated tomato and five related wild species. Based on sequence differences, we detect footprints of positive selection in over 50 genes. We also document thousands of shifts in gene-expression level, many of which resulted from changes in selection pressure. These rapidly evolving genes are commonly associated with environmental response and stress tolerance. The importance of environmental inputs during evolution of gene expression is further highlighted by large-scale alteration of the light response coexpression network between wild and cultivated accessions. Human manipulation of the genome has heavily impacted the tomato transcriptome through directed admixture and by indirectly favoring nonsynonymous over synonymous substitutions. Taken together, our results shed light on the pervasive effects artificial and natural selection have had on the transcriptomes of tomato and its wild relatives.domestication | biotic stress | abiotic stress D omestication has long served as an important example of severe phenotypic divergence in response to selection. Darwin recognized the parallel between the processes of domestication and adaptation in the wild and used this analogy to emphasize the power of selection in generating phenotypic diversity (1). The genetic basis of domestication-associated phenotypes has been examined in several instances, most notably in maize, rice, tomato, and dogs (reviewed in refs. 2-5). The clear conclusion from these studies is that the rapid phenotypic divergence associated with domestication is often attributable to very few genetic loci (6). Improvements to DNA sequence technologies have allowed studies of the effect of domestication at the whole-genome level. Early population genetic analyses in maize found that very few genes (∼5%) show evidence of positive selection during domestication of maize (7), and recent work using whole-genome resequencing has found a similar proportion of the genome was under positive selection (8). Evidence for strong selective sweeps at a limited number of loci has also been found in rice and dog genomes (9). Together with the previous genetic mapping work, these studies support the model that relatively few mutations experienced extremely strong selection by humans during domestication.Although not the target of direct positive selection, the rest of the genome still experiences a dramatic shift in evolutionary pressures during domestication. Most characterized domestication events are associated with an extreme genetic bottleneck and alleviation of selective constraints in the original niche (10). These factors are predicted to increase the relative rate of nonsynonymous to synonymous (dN/dS) substitution, potentially resulting in the fixation of deleterious alleles (11). Previous studies comparing the distribution ...
Introgression lines (ILs), in which genetic material from wild tomato species is introgressed into a domesticated background, have been used extensively in tomato (Solanum lycopersicum) improvement. Here, we genotype an IL population derived from the wild desert tomato Solanum pennellii at ultrahigh density, providing the exact gene content harbored by each line. To take advantage of this information, we determine IL phenotypes for a suite of vegetative traits, ranging from leaf complexity, shape, and size to cellular traits, such as stomatal density and epidermal cell phenotypes. Elliptical Fourier descriptors on leaflet outlines provide a global analysis of highly heritable, intricate aspects of leaf morphology. We also demonstrate constraints between leaflet size and leaf complexity, pavement cell size, and stomatal density and show independent segregation of traits previously assumed to be genetically coregulated. Meta-analysis of previously measured traits in the ILs shows an unexpected relationship between leaf morphology and fruit sugar levels, which RNA-Seq data suggest may be attributable to genetically coregulated changes in fruit morphology or the impact of leaf shape on photosynthesis. Together, our results both improve upon the utility of an important genetic resource and attest to a complex, genetic basis for differences in leaf morphology between natural populations.
Significance Ever since Darwin’s pioneering research, a major challenge in biology has been to understand the genetic basis of morphological evolution. Utilizing the natural variation in leaf morphology between tomato and two related wild species, we identified a gene network module that leads to a dynamic rewiring of interactions in the whole leaf developmental gene regulatory network. Our work experimentally validates the hypothesis that peripheral regions of network, rather than network hubs, are more likely to contribute to evolutionary innovations. Our data also suggest that, likely due to their bottleneck location in the network, the regulation in KNOX homeobox genes was repeatedly manipulated to generate natural variation in leaf shape.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.