With an ever-increasing amount of (meta)genomic data being deposited in sequence databases, (meta)genome mining for natural product biosynthetic pathways occupies a critical role in the discovery of novel pharmaceutical drugs, crop protection agents and biomaterials. The genes that encode these pathways are often organised into biosynthetic gene clusters (BGCs). In 2015, we defined the Minimum Information about a Biosynthetic Gene cluster (MIBiG): a standardised data format that describes the minimally required information to uniquely characterise a BGC. We simultaneously constructed an accompanying online database of BGCs, which has since been widely used by the community as a reference dataset for BGCs and was expanded to 2021 entries in 2019 (MIBiG 2.0). Here, we describe MIBiG 3.0, a database update comprising large-scale validation and re-annotation of existing entries and 661 new entries. Particular attention was paid to the annotation of compound structures and biological activities, as well as protein domain selectivities. Together, these new features keep the database up-to-date, and will provide new opportunities for the scientific community to use its freely available data, e.g. for the training of new machine learning models to predict sequence-structure-function relationships for diverse natural products. MIBiG 3.0 is accessible online at https://mibig.secondarymetabolites.org/.
Few tools exist in
natural products discovery to integrate biological
screening and untargeted mass spectrometry data at the library scale.
Previously, we reported Compound Activity Mapping as a strategy for
predicting compound bioactivity profiles directly from primary screening
results on extract libraries. We now present NP Analyst, an open online
platform for Compound Activity Mapping that accepts bioassay data
of almost any type, and is compatible with mass spectrometry data
from major instrument manufacturers via the mzML format. In addition,
NP Analyst will accept processed mass spectrometry data from the MZmine
2 and GNPS open-source platforms, making it a versatile tool for integration
with existing discovery workflows. We demonstrate the utility of this
new tool for both the dereplication of known compounds and the discovery
of novel bioactive natural products using a challenging low-resolution
antimicrobial bioassay data set. This new platform is available at
.
Exploration for specialized metabolites of Okinawan marine sponges Agelas spp. resulted in the isolation of five new bromopyrrole alkaloids, agesasines A (1) and B (2), 9-hydroxydihydrodispacamide (3), 9-hydroxydihydrooroidin (4), and 9E-keramadine (5). Their structures were elucidated on the basis of spectroscopic analyses. Agesasines A (1) and B (2) were assigned as rare bromopyrrole alkaloids lacking an aminoimidazole moiety, while 3–5 were elucidated to be linear bromopyrrole alkaloids with either aminoimidazolone, aminoimidazole, or N-methylated aminoimidazole moieties.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.