Plant metabolism underpins many traits of ecological and agronomic importance. Plants produce numerous compounds to cope with their environments but the biosynthetic pathways for most of these compounds have not yet been elucidated. To engineer and improve metabolic traits, we need comprehensive and accurate knowledge of the organization and regulation of plant metabolism at the genome scale. Here, we present a computational pipeline to identify metabolic enzymes, pathways, and gene clusters from a sequenced genome. Using this pipeline, we generated metabolic pathway databases for 22 species and identified metabolic gene clusters from 18 species. This unified resource can be used to conduct a wide array of comparative studies of plant metabolism. Using the resource, we discovered a widespread occurrence of metabolic gene clusters in plants: 11,969 clusters from 18 species. The prevalence of metabolic gene clusters offers an intriguing possibility of an untapped source for uncovering new metabolite biosynthesis pathways. For example, more than 1,700 clusters contain enzymes that could generate a specialized metabolite scaffold (signature enzymes) and enzymes that modify the scaffold (tailoring enzymes). In four species with sufficient gene expression data, we identified 43 highly coexpressed clusters that contain signature and tailoring enzymes, of which eight were characterized previously to be functional pathways. Finally, we identified patterns of genome organization that implicate local gene duplication and, to a lesser extent, single gene transposition as having played roles in the evolution of plant metabolic gene clusters.
Summary Protein phosphorylation and acetylation are the two most abundant post‐translational modifications (PTMs) that regulate protein functions in eukaryotes. In plants, these PTMs have been investigated individually; however, their co‐occurrence and dynamics on proteins is currently unknown. Using Arabidopsis thaliana, we quantified changes in protein phosphorylation, acetylation and protein abundance in leaf rosettes, roots, flowers, siliques and seedlings at the end of day (ED) and at the end of night (EN). This identified 2549 phosphorylated and 909 acetylated proteins, of which 1724 phosphorylated and 536 acetylated proteins were also quantified for changes in PTM abundance between ED and EN. Using a sequential dual‐PTM workflow, we identified significant PTM changes and intersections in these organs and plant developmental stages. In particular, cellular process‐, pathway‐ and protein‐level analyses reveal that the phosphoproteome and acetylome predominantly intersect at the pathway‐ and cellular process‐level at ED versus EN. We found 134 proteins involved in core plant cell processes, such as light harvesting and photosynthesis, translation, metabolism and cellular transport, that were both phosphorylated and acetylated. Our results establish connections between PTM motifs, PTM catalyzing enzymes and putative substrate networks. We also identified PTM motifs for further characterization of the regulatory mechanisms that control cellular processes during the diurnal cycle in different Arabidopsis organs and seedlings. The sequential dual‐PTM analysis expands our understanding of diurnal plant cell regulation by PTMs and provides a useful resource for future analyses, while emphasizing the importance of analyzing multiple PTMs simultaneously to elucidate when, where and how they are involved in plant cell regulation.
Background Cassava is an important food crop in tropical and sub-tropical regions worldwide. In Africa, cassava production is widely affected by cassava mosaic disease (CMD), which is caused by the African cassava mosaic geminivirus that is transmitted by whiteflies. Cassava breeders often use a single locus, CMD2, for introducing CMD resistance into susceptible cultivars. The CMD2 locus has been genetically mapped to a 10-Mbp region, but its organization and genes as well as their functions are unknown. Results We report haplotype-resolved de novo assemblies and annotations of the genomes for the African cassava cultivar TME (tropical Manihot esculenta), which is the origin of CMD2, and the CMD-susceptible cultivar 60444. The assemblies provide phased haplotype information for over 80% of the genomes. Haplotype comparison identified novel features previously hidden in collapsed and fragmented cassava genomes, including thousands of allelic variants, inter-haplotype diversity in coding regions, and patterns of diversification through allele-specific expression. Reconstruction of the CMD2 locus revealed a highly complex region with nearly identical gene sets but limited microsynteny between the two cultivars. Conclusions The genome maps of the CMD2 locus in both 60444 and TME3, together with the newly annotated genes, will help the identification of the causal genetic basis of CMD2 resistance to geminiviruses. Our de novo cassava genome assemblies will also facilitate genetic mapping approaches to narrow the large CMD2 region to a few candidate genes for better informed strategies to develop robust geminivirus resistance in susceptible cassava cultivars.
Background Cassava (Manihot esculenta) is an important clonally propagated food crop in tropical and subtropical regions worldwide. Genetic gain by molecular breeding has been limited, partially because cassava is a highly heterozygous crop with a repetitive and difficult-to-assemble genome. Findings Here we demonstrate that Pacific Biosciences high-fidelity (HiFi) sequencing reads, in combination with the assembler hifiasm, produced genome assemblies at near complete haplotype resolution with higher continuity and accuracy compared to conventional long sequencing reads. We present 2 chromosome-scale haploid genomes phased with Hi-C technology for the diploid African cassava variety TME204. With consensus accuracy >QV46, contig N50 >18 Mb, BUSCO completeness of 99%, and 35k phased gene loci, it is the most accurate, continuous, complete, and haplotype-resolved cassava genome assembly so far. Ab initio gene prediction with RNA-seq data and Iso-Seq transcripts identified abundant novel gene loci, with enriched functionality related to chromatin organization, meristem development, and cell responses. During tissue development, differentially expressed transcripts of different haplotype origins were enriched for different functionality. In each tissue, 20–30% of transcripts showed allele-specific expression (ASE) differences. ASE bias was often tissue specific and inconsistent across different tissues. Direction-shifting was observed in <2% of the ASE transcripts. Despite high gene synteny, the HiFi genome assembly revealed extensive chromosome rearrangements and abundant intra-genomic and inter-genomic divergent sequences, with large structural variations mostly related to LTR retrotransposons. We use the reference-quality assemblies to build a cassava pan-genome and demonstrate its importance in representing the genetic diversity of cassava for downstream reference-guided omics analysis and breeding. Conclusions The phased and annotated chromosome pairs allow a systematic view of the heterozygous diploid genome organization in cassava with improved accuracy, completeness, and haplotype resolution. They will be a valuable resource for cassava breeding and research. Our study may also provide insights into developing cost-effective and efficient strategies for resolving complex genomes with high resolution, accuracy, and continuity.
Phosphorus absorbed in the form of phosphate (H 2 PO 4 À) is an essential but limiting macronutrient for plant growth and agricultural productivity. A comprehensive understanding of how plants respond to phosphate starvation is essential for the development of more phosphate-efficient crops. Here we employed label-free proteomics and phosphoproteomics to quantify protein-level responses to 48 h of phosphate versus phosphite (H 2 PO 3 À) resupply to phosphate-deprived Arabidopsis thaliana suspension cells. Phosphite is similarly sensed, taken up and transported by plant cells as phosphate, but cannot be metabolized or used as a nutrient. Phosphite is thus a useful tool for differentiating between non-specific processes related to phosphate sensing and transport and specific responses to phosphorus nutrition. We found that responses to phosphate versus phosphite resupply occurred mainly at the level of protein phosphorylation, complemented by limited changes in protein abundance, primarily in protein translation, phosphate transport and scavenging, and central metabolism proteins. Altered phosphorylation of proteins involved in core processes such as translation, RNA splicing and kinase signaling was especially important. We also found differential phosphorylation in response to phosphate and phosphite in 69 proteins, including splicing factors, translation factors, the PHT1;4 phosphate transporter and the HAT1 histone acetyltransferasepotential phosphoswitches signaling changes in phosphorus nutrition. Our study illuminates several new aspects of the phosphate starvation response and identifies important targets for further investigation and potential crop improvement.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.