2021
DOI: 10.1101/2021.04.18.440324
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

MS2DeepScore - a novel deep learning similarity measure for mass fragmentation spectrum comparisons

Abstract: Mass spectrometry data is one of the key sources of information in many workflows in medicine and across the life sciences. Mass fragmentation spectra are considered characteristic signatures of the chemical compound they originate from, yet the chemical structure itself usually cannot be easily deduced from the spectrum. Often, spectral similarity measures are used as a proxy for structural similarity but this approach is strongly limited by a generally poor correlation between both metrics. Here, we propose … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
5
1

Relationship

4
2

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 38 publications
0
4
0
Order By: Relevance
“…To evaluate the extrapolation ability of CReSS, we assessed the dependence of identification accuracy on structural similarity to the training set. The Tanimoto coefficient based on ECFP4 fingerprints was employed as a measure of molecular similarity . As shown in Figure S7, molecules with a relatively high similarity to molecules in the training set had a higher recall@10 than molecules with relatively low similarity to molecules in the training set.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…To evaluate the extrapolation ability of CReSS, we assessed the dependence of identification accuracy on structural similarity to the training set. The Tanimoto coefficient based on ECFP4 fingerprints was employed as a measure of molecular similarity . As shown in Figure S7, molecules with a relatively high similarity to molecules in the training set had a higher recall@10 than molecules with relatively low similarity to molecules in the training set.…”
Section: Resultsmentioning
confidence: 99%
“…The Tanimoto coefficient based on ECFP4 fingerprints was employed as a measure of molecular similarity. 37 As shown in Figure S7, molecules with a relatively high similarity to molecules in the training set had a higher recall@10 than molecules with relatively low similarity to molecules in the training set. The results indicated that the similarity level of molecules in the external test set to those in the training set had a significant impact on the CReSS identification performance.…”
Section: ■ Experimental Sectionmentioning
confidence: 92%
“…In fact, the first example of a supervised machine learning-based approach was just proposed. 46 These developments may further assist in the biochemical interpretation of such mass spectral networks thereby facilitating the process of turning large-scale untargeted mass spectral analyses into biochemical knowledge.…”
Section: Network-based Ms-based Metabolomics Toolsmentioning
confidence: 99%
“…Additionally, Spec2Vec will enhance interpretation as it facilitates the correct assignment of metabolites, whose chemical class can be annotated and used to supplement metabolite annotation [237]. Despite its remarkable performance, Spec2Vec was not trained for the task of returning a higher similarity score for spectral pairs of structurally more closely related metabolites; therefore, very recently, MS2DeepScore was introduced that uses a Siamese neural network to predict the structural similarity between two chemical structures solely based on their MS/MS fragmentation spectra [238]. MS2DeepScore outperforms classical spectral similarity measures as well as Spec2Vec in retrieving chemically related compound pairs from large mass spectral datasets, thereby illustrating its potential for spectral library matching.…”
Section: Spectral Similarity Scoring For Library Matching and Correlation Of Spectramentioning
confidence: 99%