2023
DOI: 10.1186/s40168-022-01444-3
|View full text |Cite
|
Sign up to set email alerts
|

Enhanced correlation-based linking of biosynthetic gene clusters to their metabolic products through chemical class matching

Abstract: Background It is well-known that the microbiome produces a myriad of specialised metabolites with diverse functions. To better characterise their structures and identify their producers in complex samples, integrative genome and metabolome mining is becoming increasingly popular. Metabologenomic co-occurrence-based correlation scoring methods facilitate the linking of metabolite mass fragmentation spectra (MS/MS) to their cognate biosynthetic gene clusters (BGCs) based on shared absence/presenc… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
16
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
7

Relationship

1
6

Authors

Journals

citations
Cited by 16 publications
(16 citation statements)
references
References 32 publications
0
16
0
Order By: Relevance
“…Combining such solutions with the integration of more diverse species, new annotations, and improved correlation scoring methods like the one developed in Hjörleifsson Eldjárn et al [ 35 ] would improve such analyses drastically. Furthermore, we expect that combining co-occurrence based scores (such as the standardised Metcalf one) with feature-based scores, such as NPClassScore [ 36 ], and the here developed iPRESTO, will further help to prioritise plausible BGC-MS/MS spectral links [ 12 , 13 ]. Indeed, we expect that tools like iPRESTO could in the future be built into frameworks like NPLinker [ 35 ].…”
Section: Resultsmentioning
confidence: 99%
“…Combining such solutions with the integration of more diverse species, new annotations, and improved correlation scoring methods like the one developed in Hjörleifsson Eldjárn et al [ 35 ] would improve such analyses drastically. Furthermore, we expect that combining co-occurrence based scores (such as the standardised Metcalf one) with feature-based scores, such as NPClassScore [ 36 ], and the here developed iPRESTO, will further help to prioritise plausible BGC-MS/MS spectral links [ 12 , 13 ]. Indeed, we expect that tools like iPRESTO could in the future be built into frameworks like NPLinker [ 35 ].…”
Section: Resultsmentioning
confidence: 99%
“…3B , score with co-occurrence threshold) because this very high value was obtained using only the similarity and the biosynthetic class features and three nearest neighbors, a good number of candidates for genome mining. Moreover, the use of just the similarity and biosynthetic class is very appealing because these kinds of features can be well predicted for metabolites with known or unknown metabolites, for example, by using GNPS for similarity and CANOPUS and/or MolNetEnhancer for biosynthetic class predictions, as demonstrated by NPClassScore ( 39 ). It is much more challenging to predict substructures for unknown/cryptic metabolites, which is a topic of ongoing research by our group and others.…”
Section: Resultsmentioning
confidence: 99%
“…The latter is mainly due to the fact that NPOmix selects which metabolites (MS/MS spectra) should be targeted for classification in opposition to NPLinker that attempts to link all metabolites present in the analyzed metabolomes; (iii) NPOmix is a tool for connecting metabolites to BGCs, NPLinker is a framework that facilitates integrative omics analysis, and in the future, NPOmix will likely become part of the NPLinker framework. We stress that “benchmarking NPLinker” effectively means benchmarking the (standardized) Metcalf strain-correlation and Rosetta scoring systems; in the future, it is likely a combination of various strain-correlation-based and feature-based scoring systems (such as NPClassScore) ( 39 ) that will enable the most effective integrative omics mining. Both precisions for top-1, and top-1 shared ranks are provided in Fig.…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations