2019
DOI: 10.1021/acs.analchem.9b04811
|View full text |Cite
|
Sign up to set email alerts
|

Deep Learning for the Precise Peak Detection in High-Resolution LC–MS Data

Abstract: This letter is devoted to the application of machine learning, namely, convolutional neural networks to solve problems in the initial steps of the common pipeline for data analysis in metabolomics. These steps are the peak detection and the peak integration in raw liquid chromatography–mass spectrometry (LC–MS) data. Widely used algorithms suffer from rather poor precision for these tasks, yielding many false positive signals. In the present work, we developed an algorithm named peakonly, which has high flexib… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

1
82
1

Year Published

2020
2020
2024
2024

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 131 publications
(84 citation statements)
references
References 19 publications
1
82
1
Order By: Relevance
“…Numerous free and commercial software are available for MS-based data processing such as MZmine [47] , XCMS [45] , metaMS [48] , and metAlign [49] , to name a few. However, key challenges, such as false positive signals, co-eluting compounds and non-linear retention time shift, still need to be addressed [50] , [51] , [52] . With the complexity of MS data, DL approaches have been proposed to solve this key data pre-processing step and major bottleneck of MS-based metabolomics pipelines.…”
Section: In Ms Spectra Processing and Interpretationmentioning
confidence: 99%
See 1 more Smart Citation
“…Numerous free and commercial software are available for MS-based data processing such as MZmine [47] , XCMS [45] , metaMS [48] , and metAlign [49] , to name a few. However, key challenges, such as false positive signals, co-eluting compounds and non-linear retention time shift, still need to be addressed [50] , [51] , [52] . With the complexity of MS data, DL approaches have been proposed to solve this key data pre-processing step and major bottleneck of MS-based metabolomics pipelines.…”
Section: In Ms Spectra Processing and Interpretationmentioning
confidence: 99%
“…These profiles were initially modelled by PARAllel FACtor analysis2 (PARAFAC2) [50] , [54] and subsequently delineated into chemical peaks (metabolite), baselines and other non-related peak areas by the CNN model, which resolved which peak component were most suitable for selection or integration. Similarly, Melnikov et al proposed ‘peakonly’ algorithm [51] for both peak detection and integration that used a CNN model to classify raw LC-MS data into regions of noise, chemical peaks, and uncertain peaks, which was then used to determine peak boundaries for integration. Automated and high accuracy peak classifiers would greatly improve efficiency in these critical steps, which often heavily rely on domain experts.…”
Section: In Ms Spectra Processing and Interpretationmentioning
confidence: 99%
“…Numerous algorithms exist for the peak detection of LC-MS data [ 85 , 130 , 131 , 132 , 133 , 134 ]. Most of them first convert profile spectra to line spectra, if necessary, and then perform spectra smoothing.…”
Section: From Non-targeted Data Sets To Marker Compoundsmentioning
confidence: 99%
“…A few packages have been developed to deal with feature degeneracy: CAMERA 16 is based on adduct relationships; RAMClust 17 correlates features in multiple samples; MS-FLO 18 uses Pearson’s correlation and peak height similarity to identify adducts, duplicate peaks and isotope features of the main monoisotopic ion, and MZunity 19 which confronts adducts or neutral loss index to decipher relationship among the acquired high resolution pseudo-molecular ions list. Deep-learning approaches have also been developed based on LC-MS spectral peak shape filtering 20,21 . All these packages focus on a single type of degeneracy, however, and they are difficult to implement in a unified workflow.…”
Section: Figurementioning
confidence: 99%