Genomics and metabolomics are widely used to explore specialized metabolite diversity. The Paired Omics Data Platform is a community initiative to systematically document links between metabolome and (meta)genome data, aiding identification of natural product biosynthetic origins and metabolite structures.
Using automated genome analysis tools, it is often unclear to what degree genetic variability in homologous biosynthetic pathways relates to structural variation. This hampers strain prioritization and compound identification and can lead to overinterpretation of chemical diversity. Here, we assessed the metabolic potential of Nocardia, an underinvestigated actinobacterial genus that is known to comprise opportunistic human pathogens. Our analysis revealed a plethora of putative biosynthetic gene clusters of various classes, including polyketide, nonribosomal peptide, and terpenoid pathways. Furthermore, we used the highly conserved biosynthetic pathway for nocobactin-like siderophores to investigate how gene cluster differences correlate to structural differences in the produced compounds. Sequence similarity networks generated by BiG-SCAPE (Biosynthetic Gene Similarity Clustering and Prospecting Engine) showed the presence of several distinct gene cluster families. Metabolic profiling of selected Nocardia strains using liquid chromatography-mass spectrometry (LC-MS) metabolomics data, nuclear magnetic resonance (NMR) spectroscopy, and GNPS (Global Natural Product Social molecular networking) revealed that nocobactin-like biosynthetic gene cluster (BGC) families above a BiG-SCAPE threshold of 70% can be assigned to distinct structural types of nocobactin-like siderophores.
IMPORTANCE Our work emphasizes that Nocardia represent a prolific source for natural products rivaling better-characterized genera such as Streptomyces or Amycolatopsis. Furthermore, we showed that large-scale analysis of biosynthetic gene clusters using similarity networks with high stringency allows the distinction and prediction of natural product structural variations. This will facilitate future genomics-driven drug discovery campaigns.
We report a genomics‐guided exploration of the metabolic potential of the brasilicardin producer strain
Nocardia terpenica
IFM 0406. Bioinformatics analysis of the whole genome sequence revealed the presence of a biosynthetic gene cluster presumably responsible for the generation of formerly unknown nocobactin derivatives. Mass spectrometry‐assisted isolation led to the identification of three new siderophores, terpenibactins A (
1
), B (
2
) and C (
3
), which belong to the class of nocobactins. Their structures were elucidated by employing spectroscopic techniques. Compounds
1
–
3
demonstrated inhibitory activity towards the muscarinic M3 receptor, while exhibiting only a low cytotoxicity.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.