2022
DOI: 10.1002/pmic.202200092
|View full text |Cite
|
Sign up to set email alerts
|

Dealing with missing values in proteomics data

Abstract: Proteomics data are often plagued with missingness issues. These missing values (MVs) threaten the integrity of subsequent statistical analyses by reduction of statistical power, introduction of bias, and failure to represent the true sample. Over the years, several categories of missing value imputation (MVI) methods have been developed and adapted for proteomics data. These MVI methods perform their tasks based on different prior assumptions (e.g., data is normally or independently distributed) and operating… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
42
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 38 publications
(42 citation statements)
references
References 132 publications
(226 reference statements)
0
42
0
Order By: Relevance
“…Protein quantification is then achieved by summarizing intensities of constituent peptides. Despite advances in many aspects of MS-based proteomics, both instrumental and methodological, missing values in the peptide intensity quantifications remain a common feature of such datasets and present one of most challenging problems for downstream analysis ( Webb-Robertson et al 2015 ; Kong et al 2022 ).…”
Section: Introductionmentioning
confidence: 99%
“…Protein quantification is then achieved by summarizing intensities of constituent peptides. Despite advances in many aspects of MS-based proteomics, both instrumental and methodological, missing values in the peptide intensity quantifications remain a common feature of such datasets and present one of most challenging problems for downstream analysis ( Webb-Robertson et al 2015 ; Kong et al 2022 ).…”
Section: Introductionmentioning
confidence: 99%
“…The hurdles associated with missing values are a recurring issue in data analysis and concern a wide range of fields and applications . The handling of missing values in mass spectrometry (MS)-based proteomics is still actively debated. While elucidating the best computational approaches to manage missing values is ongoing, the field continues pushing the boundaries of low input acquisitions. Recent technical advances in MS have paved the way for MS-based single-cell proteomics (SCP), but handling missing values in SCP data is still a clear challenge to principled data analysis .…”
Section: Introductionmentioning
confidence: 99%
“…In some cases, missingness is meaningful and should not be imputed, while in other cases it may be due to limited instrument sensitivity or algorithmic/statistical issues. The mechanism behind MVs is important in determining the appropriate handling procedure 6 . MVs can be broadly categorized into three types: Missing not at random (MNAR), missing at random (MAR), and missing completely at random (MCAR) 7 .…”
Section: Introductionmentioning
confidence: 99%
“…These methods can range from simple mean or median imputation to more sophisticated techniques such as multiple imputation, hot deck imputation, expectation-maximization algorithm, and various machine learning models such as k-nearest neighbors, Random Forest, and neural networks. The choice of the MVI technique will depend on the characteristics of the data, the proportion of missing values, the type of missingness and the downstream analysis(6).…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation