2012 6th IEEE INTERNATIONAL CONFERENCE INTELLIGENT SYSTEMS 2012
DOI: 10.1109/is.2012.6335145
|View full text |Cite
|
Sign up to set email alerts
|

An Integrative DTW-based imputation method for gene expression time series data

Abstract: Gene expression microarrays are the most commonly available source of high-throughput biological data. They are widely employed for studying many different aspects of gene regulation and function, ranging from understanding the global cell-cycle control of microorganisms to cancer in humans. Gene expression microarray experiments often generate data sets with multiple missing values. Many algorithms for gene expression data analysis require a complete data matrix and therefore, the accurate estimation of missi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
5
0

Year Published

2016
2016
2025
2025

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 8 publications
(5 citation statements)
references
References 20 publications
0
5
0
Order By: Relevance
“…The results showed that the combination of MICE and RF was more efficient than original methods for multivariate imputation. K-Nearest Neighbors ( -NN)-based imputation is also a popular method for completing missing values such as [11,26,27,[30][31][32]. This approach identifies most similar patterns in the space of available features to impute missing data.…”
Section: Classical Multivariate Imputation Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…The results showed that the combination of MICE and RF was more efficient than original methods for multivariate imputation. K-Nearest Neighbors ( -NN)-based imputation is also a popular method for completing missing values such as [11,26,27,[30][31][32]. This approach identifies most similar patterns in the space of available features to impute missing data.…”
Section: Classical Multivariate Imputation Methodsmentioning
confidence: 99%
“…Step a and Step b for data (32) return position of -the most similar window to (33) end for (34) Replace the missing values at the position by average vector of the window after and the one previous (35) end for (36) end for (37) return -imputed time series This makes it possible to find out windows that have the most similar dynamics and shape to the queries. …”
Section: Fuzzy-weighted Similarity Measure Between Subsequencesmentioning
confidence: 99%
“…The abovementioned approaches are widely used in various fields, such as multimedia, healthcare, and finance [ 29 ]. These approaches have been applied in major research topics including earthquake prediction [ 30 ], terrestrial ecosystem dynamics [ 31 ], stock-price data, exchange-rate analysis [ 32 ], and bioinformatics [ 33 ]. However, they have not been specialized for binary time-series.…”
Section: Introductionmentioning
confidence: 99%
“…DTWcost is used as distance metric instead of pointwise distance measurements. Kostadinova et al [27] proposed an Integrative DTW-Based Imputation algorithm that is particularly suited for the estimation of missing values in gene expression time series data using multiple related information in datasets. This algorithm identifies an appropriate set of estimation matrices by using DTW-cost distance in order to measure similarities between gene expression matrices.…”
Section: Introductionmentioning
confidence: 99%