OPENRosaceae is the most important fruit-producing clade, and its key commercially relevant genera (Fragaria, Rosa, Rubus and Prunus) show broadly diverse growth habits, fruit types and compact diploid genomes. Peach, a diploid Prunus species, is one of the best genetically characterized deciduous trees. Here we describe the high-quality genome sequence of peach obtained from a completely homozygous genotype. We obtained a complete chromosome-scale assembly using Sanger whole-genome shotgun methods. We predicted 27,852 protein-coding genes, as well as noncoding RNAs. We investigated the path of peach domestication through whole-genome resequencing of 14 Prunus accessions. The analyses suggest major genetic bottlenecks that have substantially shaped peach genome diversity. Furthermore, comparative analyses showed that peach has not undergone recent whole-genome duplication, and even though the ancestral triplicated blocks in peach are fragmentary compared to those in grape, all seven paleosets of paralogs from the putative paleoancestor are detectable.
BackgroundThe availability of the peach genome sequence has fostered relevant research in peach and related Prunus species enabling the identification of genes underlying important horticultural traits as well as the development of advanced tools for genetic and genomic analyses. The first release of the peach genome (Peach v1.0) represented a high-quality WGS (Whole Genome Shotgun) chromosome-scale assembly with high contiguity (contig L50 214.2 kb), large portions of mapped sequences (96%) and high base accuracy (99.96%). The aim of this work was to improve the quality of the first assembly by increasing the portion of mapped and oriented sequences, correcting misassemblies and improving the contiguity and base accuracy using high-throughput linkage mapping and deep resequencing approaches.ResultsFour linkage maps with 3,576 molecular markers were used to improve the portion of mapped and oriented sequences (from 96.0% and 85.6% of Peach v1.0 to 99.2% and 98.2% of v2.0, respectively) and enabled a more detailed identification of discernible misassemblies (10.4 Mb in total). The deep resequencing approach fixed 859 homozygous SNPs (Single Nucleotide Polymorphisms) and 1347 homozygous indels. Moreover, the assembled NGS contigs enabled the closing of 212 gaps with an improvement in the contig L50 of 19.2%.ConclusionsThe improved high quality peach genome assembly (Peach v2.0) represents a valuable tool for the analysis of the genetic diversity, domestication, and as a vehicle for genetic improvement of peach and related Prunus species. Moreover, the important phylogenetic position of peach and the absence of recent whole genome duplication (WGD) events make peach a pivotal species for comparative genomics studies aiming at elucidating plant speciation and diversification processes.Electronic supplementary materialThe online version of this article (doi:10.1186/s12864-017-3606-9) contains supplementary material, which is available to authorized users.
Although a large number of single nucleotide polymorphism (SNP) markers covering the entire genome are needed to enable molecular breeding efforts such as genome wide association studies, fine mapping, genomic selection and marker-assisted selection in peach [Prunus persica (L.) Batsch] and related Prunus species, only a limited number of genetic markers, including simple sequence repeats (SSRs), have been available to date. To address this need, an international consortium (The International Peach SNP Consortium; IPSC) has pursued a coordinated effort to perform genome-scale SNP discovery in peach using next generation sequencing platforms to develop and characterize a high-throughput Illumina Infinium® SNP genotyping array platform. We performed whole genome re-sequencing of 56 peach breeding accessions using the Illumina and Roche/454 sequencing technologies. Polymorphism detection algorithms identified a total of 1,022,354 SNPs. Validation with the Illumina GoldenGate® assay was performed on a subset of the predicted SNPs, verifying ∼75% of genic (exonic and intronic) SNPs, whereas only about a third of intergenic SNPs were verified. Conservative filtering was applied to arrive at a set of 8,144 SNPs that were included on the IPSC peach SNP array v1, distributed over all eight peach chromosomes with an average spacing of 26.7 kb between SNPs. Use of this platform to screen a total of 709 accessions of peach in two separate evaluation panels identified a total of 6,869 (84.3%) polymorphic SNPs.The almost 7,000 SNPs verified as polymorphic through extensive empirical evaluation represent an excellent source of markers for future studies in genetic relatedness, genetic mapping, and dissecting the genetic architecture of complex agricultural traits. The IPSC peach SNP array v1 is commercially available and we expect that it will be used worldwide for genetic studies in peach and related stone fruit and nut species.
Peach flesh color (white or yellow) is among the most popular commercial criteria for peach classification, and has implications for consumer acceptance and fruit nutritional quality. Despite the increasing interest in improving cultivars of both flesh types, little is known about the genetic basis for the carotenoid content diversity in peach. Here we describe the association between genotypes at a locus encoding the carotenoid cleavage dioxygenase 4 (PpCCD4), localized in pseudomolecule 1 of the Prunus persica reference genome sequence, and the flesh color for 37 peach varieties, including two somatic revertants, and three ancestral relatives of peach, providing definitive evidence that this locus is responsible for flesh color phenotype. We show that yellow peach alleles have arisen from various ancestral haplotypes by at least three independent mutational events involving nucleotide substitutions, small insertions and transposable element insertions, and that these mutations, despite being located within the transcribed portion of the gene, also result in marked differences in transcript levels, presumably as a consequence of differential transcript stability involving nonsense-mediated mRNA decay. The PpCCD4 gene provides a unique example of a gene for which humans, in their quest to diversify phenotypic appearance and qualitative characteristics of a fruit, have been able to select and exploit multiple mutations resulting from a variety of mechanisms.
BackgroundMADS-box genes encode a family of eukaryotic transcription factors distinguished by the presence of a highly-conserved ~58 amino acid DNA-binding and dimerization domain (the MADS-box). The central role played by MADS-box genes in peach endodormancy regulation led us to examine this large gene family in more detail. We identified the locations and sequences of 79 MADS-box genes in peach, separated them into established subfamilies, and broadly surveyed their tissue-specific and dormancy-induced expression patterns using next-generation sequencing. We then focused on the dormancy-related SVP/AGL24 and FLC subfamilies, comparing their numbers and phylogenetic relationships with those of other sequenced woody perennial genomes.ResultsWe identified 79 MADS-box genes distributed across all eight peach chromosomes and frequently located in clusters of two or more genes. They encode proteins with a mean length of 248 ± 72 amino acids and include representatives from most of the thirteen Type II (MIKC) subfamilies, as well as members of the Type I Mα, Mβ, and Mγ subfamilies. Most Type I genes were present in species-specific monophyletic lineages, and their expression in the peach sporophyte was low or absent. Most Type II genes had Arabidopsis orthologs and were expressed at much higher levels throughout vegetative and fruit tissues. During short-day-induced growth cessation, seven Type II genes from the SVP/AGL24, AGL17, and SEP subfamilies showed significant changes in expression. Phylogenetic analyses indicated that multiple, independent expansions have taken place within the SVP/AGL24 and FLC lineages in woody perennial species.ConclusionsMost Type I genes appear to have arisen through tandem duplications after the divergence of the Arabidopsis and peach lineages, whereas Type II genes appear to have increased following whole genome duplication events. An exception to the latter rule occurs in the FLC and SVP/AGL24 Type II subfamilies, in which species-specific tandem duplicates have been retained in a number of perennial species. These subfamilies comprise part of a genetic toolkit that regulates endodormancy transitions, but phylogenetic and expression data suggest that individual orthologs may not function identically across all species.Electronic supplementary materialThe online version of this article (doi:10.1186/s12870-015-0436-2) contains supplementary material, which is available to authorized users.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.