2016
DOI: 10.1002/pmic.201500431
|View full text |Cite
|
Sign up to set email alerts
|

How to talk about protein‐level false discovery rates in shotgun proteomics

Abstract: A frequently sought output from a shotgun proteomics experiment is a list of proteins that we believe to have been present in the analyzed sample before proteolytic digestion. The standard technique to control for errors in such lists is to enforce a preset threshold for the false discovery rate (FDR). Many consider protein‐level FDRs a difficult and vague concept, as the measurement entities, spectra, are manifestations of peptides and not proteins. Here, we argue that this confusion is unnecessary and provid… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
62
0

Year Published

2016
2016
2024
2024

Publication Types

Select...
4
2
1

Relationship

3
4

Authors

Journals

citations
Cited by 51 publications
(62 citation statements)
references
References 33 publications
0
62
0
Order By: Relevance
“…stemming from the set of 1000 non-present PrESTs or from the set of the mixtures not used in the sample) as being incorrectly matched. As the correct matches only map to present proteins, whereas incorrect matches distribute over both present and absent proteins, we also normalized the Entrapment FDR by the prior probability of the PrEST to be absent, the so-called π A [4].…”
Section: Data Processingmentioning
confidence: 99%
See 2 more Smart Citations
“…stemming from the set of 1000 non-present PrESTs or from the set of the mixtures not used in the sample) as being incorrectly matched. As the correct matches only map to present proteins, whereas incorrect matches distribute over both present and absent proteins, we also normalized the Entrapment FDR by the prior probability of the PrEST to be absent, the so-called π A [4].…”
Section: Data Processingmentioning
confidence: 99%
“…Currently, there are two available methods to determine the accuracy of inference procedures and their error estimates: (i) simulations of proteomics experiments and (ii) analysis of experiments on samples with known protein content. By simulating the proteolytic digestion and the subsequent matching of mass spectra to peptides [2,3,4] one can obtain direct insights into how well the simulated absence or presence of a protein is reflected by a protein inference procedure. However, there is always the risk that the assumptions of the simulations are diverging from the complex nature of a mass spectrometry experiment.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…The need for an integrated model becomes clear once one considers the most natural hypothesis [33] for protein quantification: one strives to estimate the combined probability that a particular protein is (i) correctly identified, (ii) correctly quantified and (iii) present in a different quantity between treatment groups. The separate probabilities of (i), (ii) and (iii) are less interesting individually and worse, one is easily lulled into a false sense of reliability by claims of control of the FDR in individual steps.…”
Section: Introductionmentioning
confidence: 99%
“…Firstly, proteins are selected for further analysis based on identification FDR. However, the identification FDR is an estimate of the evidence for the presence of proteins [33], and not a measure of how quantifiable they are i.e. their peptides being detected across conditions and being in the quantifiable range.…”
Section: Introductionmentioning
confidence: 99%