Nerpa: A Tool for Discovering Biosynthetic Gene Clusters of Bacterial Nonribosomal Peptides

Kunyavskaya, Olga; Tagirdzhanov, A. M.; Caraballo‐Rodríguez, Andrés Mauricio; Nothias, Louis Félix; Dorrestein, Pieter C.; Korobeynikov, Anton; Mohimani, Hosein; Gurevich, Alexey

doi:10.3390/metabo11100693

Cited by 15 publications

(17 citation statements)

References 56 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Although GARLIC had a higher specificity, in general, NRP to BGC matching quality obtained on the BioCAT results turned out to be higher than the GARLIC quality. The Nerpa tool [18] recently published and based on the same external software has shown similar general matching quality and as well as BioCAT overperformed GARLIC. However, it should be noted here that BioCAT was designed to be more sensitive than specific, unlike Nerpa which has shown high matching specificity but moderate sensitivity.…”

Section: Discussionmentioning

confidence: 97%

“…Additionally, the method was compared with the Nerpa tool published recently [18] . The same dataset of 984 genome/BGC pairs was analyzed with Nerpa.…”

Section: Methodsmentioning

confidence: 99%

“…In addition, we showed the applicability of BioCAT on several external data, including complete genomes as well as draft ones. Finally, we compared the BioCAT pipeline with the GARLIC tool [6] and Nerpa [18] which have a similar functionality.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

BioCAT: Search for biosynthetic gene clusters producing nonribosomal peptides with known structure

Konanov

Krivonos²,

Ilina³

et al. 2022

Computational and Structural Biotechnology Journal

View full text Add to dashboard Cite

Section: Discussionmentioning

confidence: 97%

“…Additionally, the method was compared with the Nerpa tool published recently [18] . The same dataset of 984 genome/BGC pairs was analyzed with Nerpa.…”

Section: Methodsmentioning

confidence: 99%

See 1 more Smart Citation

BioCAT: Search for biosynthetic gene clusters producing nonribosomal peptides with known structure

Konanov

Krivonos²,

Ilina³

et al. 2022

Computational and Structural Biotechnology Journal

View full text Add to dashboard Cite

“…Natural products are the major source of lead compounds for drug development and represent the majority of small-molecule drugs that were already on the market [ 25 , 26 ]. However, due to the repeated discovery of known compounds, the hit rate of new skeletal compounds come down every year since the study of streptomyces metabolites reached its summit in the 1970s [ 27 ].…”

Section: Discussionmentioning

confidence: 99%

Carbon-nitrogen bond formation to construct novel polyketide-indole hybrids from the indole-3-carbinol exposed culture of Daldinia eschscholzii

Lin

Jiang

et al. 2022

Synthetic and Systems Biotechnology

View full text Add to dashboard Cite

“…Recently, some approaches and tools have been created to connect specialized metabolites (known and cryptic MS/MS spectra) to their biosynthetic gene clusters, such as Pattern-based Genome Mining (7, 8), MetaMiner (9), DeepRiPP (10), NRPquest (11), NRPminer (12), GNP (13) and NPLinker (14), recently reviewed by Van der Hooft et al ., 2020 (15). Nerpa (16) and GARLIC (17) can connect structures to BGCs; structures are normally represented in the SMILES (Simplified Molecular-Input Line-Entry System) format, a type of computer-readable annotation language for chemical structures. However, most of these tools are neither high throughput, nor efficient, or can only be used for a particular class of BGC (e.g., peptides or BGCs homologous to known BGCs).…”

Section: Introductionmentioning

confidence: 99%

NPOmix: a machine learning classifier to connect mass spectrometry fragmentation data to biosynthetic gene clusters

Leão

Wang

Silva

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

Microbial natural products, in particular secondary or specialized metabolites, are an important source and inspiration for many pharmaceutical and biotechnological products. However, bioactivity-guided methods widely employed in natural product discovery programs do not explore the full biosynthetic potential of microorganisms, and they usually miss metabolites that are produced at low titer. As a complementary method, the use of genome-based mining in natural products research has facilitated the charting of many novel natural products in the form of predicted biosynthetic gene clusters that encode for their production. Linking the biosynthetic potential inferred from genomics to the specialized metabolome measured by metabolomics would accelerate natural product discovery programs. Here, we applied a supervised machine learning approach, the K-Nearest Neighbor (KNN) classifier, for systematically connecting metabolite mass spectrometry data to their biosynthetic gene clusters. This pipeline offers a method for annotating the biosynthetic genes for known, analogous to known and cryptic metabolites that are detected via mass spectrometry. We demonstrate this approach by automated linking of six different natural product mass spectra, and their analogs, to their corresponding biosynthetic genes. Our approach can be applied to bacterial, fungal, algal and plant systems where genomes are paired with corresponding MS/MS spectra. Additionally, an approach that connects known metabolites to their biosynthetic genes potentially allows for bulk production via heterologous expression and it is especially useful for cases where the metabolites are produced at low amounts in the original producer.

show abstract

Nerpa: A Tool for Discovering Biosynthetic Gene Clusters of Bacterial Nonribosomal Peptides

Cited by 15 publications

References 56 publications

BioCAT: Search for biosynthetic gene clusters producing nonribosomal peptides with known structure

BioCAT: Search for biosynthetic gene clusters producing nonribosomal peptides with known structure

Carbon-nitrogen bond formation to construct novel polyketide-indole hybrids from the indole-3-carbinol exposed culture of Daldinia eschscholzii

NPOmix: a machine learning classifier to connect mass spectrometry fragmentation data to biosynthetic gene clusters

Contact Info

Product

Resources

About