2004
DOI: 10.1093/bioinformatics/bth947
|View full text |Cite
|
Sign up to set email alerts
|

Automatic Quality Assessment of Peptide Tandem Mass Spectra

Abstract: We report on two different approaches to assessing spectral quality prior to identification: binary classification, which predicts whether or not SEQUEST will be able to make an identification, and statistical regression, which predicts a more universal quality metric involving the number of b- and y-ion peaks. The best of our binary classifiers can eliminate over 75% of the unidentifiable spectra while losing only 10% of the identifiable spectra. Statistical regression can pick out spectra of modified peptide… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
196
0

Year Published

2004
2004
2017
2017

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 186 publications
(198 citation statements)
references
References 18 publications
2
196
0
Order By: Relevance
“…Tandem mass spectra were extracted into ms1 and ms2 files (14) from raw files using RawExtract 1.9.9. Poor quality spectra were removed from the dataset using an automated spectral quality assessment algorithm (17). MS/MS spectra remaining after filtering were searched with the ProLuCID algorithm against the EBI-IPI_Mouse_01_01_04-19-2009 concatenated to a decoy data base in which the sequence for each entry in the original data base was reversed (18).…”
Section: Fractionation Of Brain Tissue and Affinity Purifications-mentioning
confidence: 99%
“…Tandem mass spectra were extracted into ms1 and ms2 files (14) from raw files using RawExtract 1.9.9. Poor quality spectra were removed from the dataset using an automated spectral quality assessment algorithm (17). MS/MS spectra remaining after filtering were searched with the ProLuCID algorithm against the EBI-IPI_Mouse_01_01_04-19-2009 concatenated to a decoy data base in which the sequence for each entry in the original data base was reversed (18).…”
Section: Fractionation Of Brain Tissue and Affinity Purifications-mentioning
confidence: 99%
“…Methods have been devised to sort the good from the bad spectra 34,37,38 . Recently, Bern et al presented two different algorithms for assessing spectral quality prior to a database search: a binary classifier, which predicts whether or not the search engine will be able to make an identification, and a statistical regression, which predicts a more universal quality metric, independent of the database search program 39 . A quadratic discriminant analysis, a classical machine learning algorithm, was trained on a data set of manually validated good and bad spectra.…”
Section: Reviewmentioning
confidence: 99%
“…To limit the number of spectra and ensure an enriched set of unique spectra, poor-quality spectra could be removed and duplicates identified and eliminated 39,60,61 . Spectra should be searched with at least two algorithms to take advantage of the different selectivities of algorithms (for example, SEQUEST and Mascot).…”
Section: Box 3 Strategy For Large Scale Data Analysismentioning
confidence: 99%
“…Large MS/MS datasets contain many low-quality spectra that cannot result in reliable peptide identifications [28,29]. Typically, when a whole MS/MS dataset is searched, only a small fraction of the spectra (less than 20%) get identified.…”
Section: Filtering Ms/ms Datasetsmentioning
confidence: 99%