2009
DOI: 10.2298/csis0902165h
|View full text |Cite
|
Sign up to set email alerts
|

Microarray missing values imputation methods: Critical analysis review

Abstract: Gene expression data often contain missing expression values. For the purpose of conducting an effective clustering analysis and since many algorithms for gene expression data analysis require a complete matrix of gene array values, choosing the most effective missing value estimation method is necessary. In this paper, the most commonly used imputation methods from literature are critically reviewed and analyzed to explain the proper use, weakness and point the observations on each published method. From the … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
5
0

Year Published

2013
2013
2024
2024

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 9 publications
(5 citation statements)
references
References 11 publications
0
5
0
Order By: Relevance
“…Note also that C ⊂ C. Indeed, it is easy to see that σ 1 is the largest eigenvalue of Σ with multiplicity 1 and associated eigenvector θ with less than s 1 nonzero components, I p − θθ ⊤ ∞ = 1 and (I p − θθ ⊤ )θ = 0. Next, we define ω 0 = (1, 1, 0, • • • , 0) ∈ {0, 1} p and Ω = ω = (ω (1) , • • • , ω (p) ) ∈ {0, 1} p : ω (1) = ω (2) = 1, |ω| 0 = s 1 ∪ {ω 0 }.…”
Section: Proof Of Theoremmentioning
confidence: 99%
See 1 more Smart Citation
“…Note also that C ⊂ C. Indeed, it is easy to see that σ 1 is the largest eigenvalue of Σ with multiplicity 1 and associated eigenvector θ with less than s 1 nonzero components, I p − θθ ⊤ ∞ = 1 and (I p − θθ ⊤ )θ = 0. Next, we define ω 0 = (1, 1, 0, • • • , 0) ∈ {0, 1} p and Ω = ω = (ω (1) , • • • , ω (p) ) ∈ {0, 1} p : ω (1) = ω (2) = 1, |ω| 0 = s 1 ∪ {ω 0 }.…”
Section: Proof Of Theoremmentioning
confidence: 99%
“…The simple strategy that consists in eliminating from the PCA study any gene with at least one missing observation is not acceptable in this context since up to 95% of the genes can be eliminated from the study. An alternative strategy consists in infering the missing values prior to the PCA using complex imputation schemes [2,5]. These schemes usually assume that the genes interactions follow some specified model and involve intensive computational preprocessing to imput the missing observations.…”
Section: Introductionmentioning
confidence: 99%
“…Imputation strategies, such as substituting row averages for missing values, are necessary. Additionally, SVD’s performance is sensitive to data type, exhibiting potential challenges in non-time series datasets lacking clear expression patterns (59). Its linear regression nature in lower-dimensional space may result in diminished performance for non-time series data, where expression patterns are often less distinct (57).…”
Section: Complex Imputation Methodsmentioning
confidence: 99%
“…Even though this is a low-cost solution, it might produce lowquality datasets. The third category, imputation, is one of the best methods that can renew the whole dataset in order to prove the best means to process the missing values in the experiments (Tian et al, 2012;Hourani & Emary, 2009). Accordingly, the imputation method attempts to increase the relevancy and knowledge from the data that are able to construct a complete dataset.…”
Section: Introductionmentioning
confidence: 99%