These authors contributed equally to the manuscript. SUMMARYThe pentatricopeptide repeat (PPR) proteins form one of the largest protein families in land plants. They are characterised by tandem 30-40 amino acid motifs that form an extended binding surface capable of sequence-specific recognition of RNA strands. Almost all of them are post-translationally targeted to plastids and mitochondria, where they play important roles in post-transcriptional processes including splicing, RNA editing and the initiation of translation. A code describing how PPR proteins recognise their RNA targets promises to accelerate research on these proteins, but making use of this code requires accurate definition and annotation of all of the various nucleotide-binding motifs in each protein. We have used a structural modelling approach to define 10 different variants of the PPR motif found in plant proteins, in addition to the putative deaminase motif that is found at the C-terminus of many RNA-editing factors. We show that the super-helical RNA-binding surface of RNA-editing factors is potentially longer than previously recognised. We used the redefined motifs to develop accurate and consistent annotations of PPR sequences from 109 genomes. We report a high error rate in PPR gene models in many public plant proteomes, due to gene fusions and insertions of spurious introns. These consistently annotated datasets across a wide range of species are valuable resources for future comparative genomics studies, and an essential pre-requisite for accurate large-scale computational predictions of PPR targets. We have created a web portal (http://www.-plantppr.com) that provides open access to these resources for the community.
The de novo evolution of genes and the novel proteins they encode has stimulated much interest in the contribution such innovations make to the diversity of life. Most research on this de novo evolution focuses on transcripts, so studies on the biochemical steps that can enable completely new proteins to evolve and the time required to do so have been lacking. Sunflower Preproalbumin with SFTI-1 (PawS1) is an unusual albumin precursor because in addition to producing albumin it also yields a potent, bicyclic protease-inhibitor called SunFlower Trypsin Inhibitor-1 (SFTI-1). Here, we show how this inhibitor peptide evolved stepwise over tens of millions of years. To trace the origin of the inhibitor peptide SFTI-1, we assembled seed transcriptomes for 110 sunflower relatives whose evolution could be resolved by a chronogram, which allowed dates to be estimated for the various stages of molecular evolution. A genetic insertion event in an albumin precursor gene ∼45 Ma introduced two additional cleavage sites for protein maturation and conferred duality upon PawS1-Like genes such that they also encode a small buried macrocycle. Expansion of this region, including two Cys residues, enlarged the peptide ∼34 Ma and made the buried peptides bicyclic. Functional specialization into a protease inhibitor occurred ∼23 Ma. These findings document the evolution of a novel peptide inside a benign region of a pre-existing protein. We illustrate how a novel peptide can evolve without de novo gene evolution and, critically, without affecting the function of what becomes the protein host.
Plant asparaginyl endopeptidases (AEPs) are expressed as inactive zymogens that perform maturation of seed storage protein upon cleavage-dependent autoactivation in the low-pH environment of storage vacuoles. The AEPs have attracted attention for their macrocyclization reactions, and have been classified as cleavage or ligation specialists. However, we have recently shown that the ability of AEPs to produce either cyclic or acyclic products can be altered by mutations to the active site region, and that several AEPs are capable of macrocyclization given favorable pH conditions. One AEP extracted from Clitoria ternatea seeds (butelase 1) is classified as a ligase rather than a protease, presenting an opportunity to test for loss of cleavage activity. Here, making recombinant butelase 1 and rescuing an Arabidopsis thaliana mutant lacking AEP, we show that butelase 1 retains cleavage functions in vitro and in vivo. The in vivo rescue was incomplete, consistent with some trade-off for butelase 1 specialization toward macrocyclization. Its crystal structure showed an active site with only subtle differences from cleaving AEPs, suggesting the many differences in its peptide-binding region are the source of its efficient macrocyclization. All considered, it seems that either butelase 1 has not fully specialized or a requirement for autocatalytic cleavage is an evolutionary constraint upon macrocyclizing AEPs.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.