Linaridins are members of the ribosomally synthesized and post-translationally modified peptide (RiPP) family of natural products. Five linaridins have been reported, which are defined by the presence of dehydrobutyrine, a dehydrated threonine residue. This work describes the development of a linaridinspecific scoring module for Rapid ORF Description and Evaluation Online (RODEO), a genome-mining tool tailored towards RiPP discovery. Upon mining publicly accessible genomes available in the NCBI database, RODEO identified 561 (382 non-redundant) linaridin biosynthetic gene clusters (BGCs). Linaridin BGCs with unique gene architectures and precursor sequences markedly different from previous predictions were uncovered during these efforts. To aid in dataset validation, two new linaridins, pegvadin A and B, were detected through reactivity-based screening (RBS) and isolated from Streptomyces noursei and Streptomyces auratus, respectively. RBS involves the use of a reactive chemical probe that chemoselectively modifies a functional group present in the natural product. The dehydrated amino acids present in linaridins as a/b-unsaturated carbonyls were appropriate electrophiles for nucleophilic 1,4addition using a thiol-functionalized probe. The data presented within significantly expands the number of predicted linaridin BGCs and serves as a road map for future work in the area. The combination of bioinformatics and RBS is a powerful approach to accelerate natural product discovery.
The era of inexpensive genome sequencing and improved bioinformatics tools has reenergized the study of natural products, including the ribosomally synthesized and post-translationally modified peptides (RiPPs). In recent years, RiPP discovery has challenged preconceptions about the scope of post-translational modification chemistry, but genome mining of new RiPP classes remains an unsolved challenge. Here, we report a RiPP class defined by an unusual (S)-N2,N2-dimethyl-1,2-propanediamine (Dmp)-modified C-terminus, which we term the daptides. Nearly 500 daptide biosynthetic gene clusters (BGCs) were identified by analyzing the RiPP Recognition Element (RRE), a common substrate-binding domain found in half of prokaryotic RiPP classes. A representative daptide BGC from Microbacterium paraoxydans DSM 15019 was selected for experimental characterization. Derived from a C-terminal threonine residue, the class-defining Dmp is installed over three steps by an oxidative decarboxylase, aminotransferase, and methyltransferase. Daptides uniquely harbor two positively charged termini, and thus we suspect this modification could aid in membrane targeting, as corroborated by hemolysis assays. Our studies further show that the oxidative decarboxylation step requires a functionally unannotated accessory protein. Fused to the C-terminus of the accessory protein is an RRE domain, which delivers the unmodified substrate peptide to the oxidative decarboxylase. This discovery of a class-defining post-translational modification in RiPPs may serve as a prototype for unveiling additional RiPP classes through genome mining.
The era of inexpensive genome sequencing and improved bioinformatics tools has reenergized the study of natural products, including the ribosomally synthesized and post-translationally modified peptides (RiPPs). In recent years, RiPP discovery has challenged preconceptions about the scope of post-translational modification chemistry, but genome mining of new RiPP classes remains an unsolved challenge. Here, we report a RiPP class defined by an unusual (S)-N2,N2-dimethyl-1,2-propanediamine (Dmp)-modified C-terminus, which we term the daptides. Nearly 500 daptide biosynthetic gene clusters (BGCs) were identified by analyzing the RiPP Recognition Element (RRE), a common substrate-binding domain found in half of prokaryotic RiPP classes. A representative daptide BGC from Microbacterium paraoxydans DSM 15019 was selected for experimental characterization. Derived from a C-terminal threonine residue, the class-defining Dmp is installed over three steps by an oxidative decarboxylase, aminotransferase, and methyltransferase. Daptides uniquely harbor two positively charged termini, and thus we suspect this modification could aid in membrane targeting, as corroborated by hemolysis assays. Our studies further show that the oxidative decarboxylation step requires a functionally unannotated accessory protein. Fused to the C-terminus of the accessory protein is an RRE domain, which delivers the unmodified substrate peptide to the oxidative decarboxylase. This discovery of a class-defining post-translational modification in RiPPs may serve as a prototype for unveiling additional RiPP classes through genome mining.
The radical S-adenosylmethionine (rSAM) superfamily has become a wellspring for discovering new enzyme chemistry, especially regarding ribosomally synthesized and post-translationally modified peptides (RiPPs). Here, we report a compendium of nearly 15,000 rSAM proteins with high-confidence involvement in RiPP biosynthesis. While recent bioinformatics advances have unveiled the broad sequence space covered by rSAM proteins, the significant challenge of functional annotation remains unsolved. Through a combination of sequence analysis and protein structural predictions, we identified a set of catalytic site proximity residues with functional predictive power, especially among the diverse rSAM proteins that form sulfur-to-α carbon thioether (sactionine) linkages. As a case study, we report that an rSAM protein from Streptomyces sparsogenes (StsB) shares higher full-length similarity with MftC (mycofactocin biosynthesis) than any other characterized enzyme. However, a comparative analysis of StsB to known rSAM proteins using "catalytic site proximity" predicted that StsB would be distinct from MftC and instead form sactionine bonds. The prediction was confirmed by mass spectrometry, targeted mutagenesis, and chemical degradation. We further used "catalytic site proximity" analysis to identify six new sactipeptide groups undetectable by traditional genome-mining strategies. Additional catalytic site proximity profiling of cyclophane-forming rSAM proteins suggests that this approach will be more broadly applicable and enhance, if not outright correct, protein functional predictions based on traditional genomic enzymology principles.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.