2015
DOI: 10.1371/journal.pone.0134256
|View full text |Cite
|
Sign up to set email alerts
|

Signal Partitioning Algorithm for Highly Efficient Gaussian Mixture Modeling in Mass Spectrometry

Abstract: Mixture - modeling of mass spectra is an approach with many potential applications including peak detection and quantification, smoothing, de-noising, feature extraction and spectral signal compression. However, existing algorithms do not allow for automated analyses of whole spectra. Therefore, despite highlighting potential advantages of mixture modeling of mass spectra of peptide/protein mixtures and some preliminary results presented in several papers, the mixture modeling approach was so far not developed… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
26
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
7
3

Relationship

5
5

Authors

Journals

citations
Cited by 34 publications
(26 citation statements)
references
References 44 publications
0
26
0
Order By: Relevance
“…The MSI spectra dataset was preprocessed by performing mass channels unification, baseline subtraction [ 35 ], outlying spectra identification according to TIC (total ion current) value using criterion for skewed and heavy-tailed distributions [ 36 ], fast Fourier transform-based peak alignment to reference average spectrum [ 37 ], and TIC normalization. Gaussian mixture modeling (GMM) of the average spectrum was applied for peak detection as described in detail elsewhere [ 38 , 39 ]. GMM components of low amplitude and high variance were removed from initial GMM spectra representation; GMM components modeling the same spectrum peak were merged by summing their estimated abundance and setting the location of a dominant component as mass/charge value of a peptide ion.…”
Section: Methodsmentioning
confidence: 99%
“…The MSI spectra dataset was preprocessed by performing mass channels unification, baseline subtraction [ 35 ], outlying spectra identification according to TIC (total ion current) value using criterion for skewed and heavy-tailed distributions [ 36 ], fast Fourier transform-based peak alignment to reference average spectrum [ 37 ], and TIC normalization. Gaussian mixture modeling (GMM) of the average spectrum was applied for peak detection as described in detail elsewhere [ 38 , 39 ]. GMM components of low amplitude and high variance were removed from initial GMM spectra representation; GMM components modeling the same spectrum peak were merged by summing their estimated abundance and setting the location of a dominant component as mass/charge value of a peptide ion.…”
Section: Methodsmentioning
confidence: 99%
“…The basic preprocessing steps included: spectrum resampling, adaptive baseline correction (Bednarczyk et al 2017 ), identification of the outlying spectra (as those with too big or too small TIC) with the use of Bruffaerts’ criterion for extremely skewed distributions (Bruffaerts et al 2014 ), spectra alignment to the average spectrum based on Fast Fourier Transformation (Wong et al 2005 ), and TIC normalization. Gaussian mixture model (GMM) approach (Polanski et al 2015 ) was used for the average spectrum modeling and peak detection. GMM components of high variance and/or low amplitude were filtered out reducing the data dimensionality.…”
Section: Methodsmentioning
confidence: 99%
“…Standard spectrum preprocessing sequenced steps were applied as follows: (i) resampling to common mass channels, (ii) adaptive baseline detection and correction [24], (iii) outlying spectra identification according to TIC value using Bruffaerts' criterion [25], (iv) fast Fourier transform-based spectral alignment [26], and (v) TIC normalization. The Gaussian mixture model (GMM) approach described in detail elsewhere [27,28] was used for the average spectrum modeling and peak detection. Peptide abundance was estimated by pairwise convolution of the GMM components and individual spectra.…”
Section: Spectra Processing and Identification Of Spectral Componentsmentioning
confidence: 99%