2019
DOI: 10.1186/s12859-019-2832-3
|View full text |Cite
|
Sign up to set email alerts
|

Optimal clustering with missing values

Abstract: Background Missing values frequently arise in modern biomedical studies due to various reasons, including missing tests or complex profiling technologies for different omics measurements. Missing values can complicate the application of clustering algorithms, whose goals are to group points based on some similarity criterion. A common practice for dealing with missing values in the context of clustering is to first impute the missing values, and then apply the clustering algorithm on the completed… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
12
0
1

Year Published

2019
2019
2023
2023

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 19 publications
(13 citation statements)
references
References 40 publications
(49 reference statements)
0
12
0
1
Order By: Relevance
“…We used a standard imputation‐followed‐by‐clustering approach, and it could be argued that neither the filter (imputation) nor the decision (clustering) were optimal. Recent work proposed a model‐based optimal clustering approach for data from microarrays and RNA sequencing (RNA‐seq) by incorporating the missing value mechanism into a random labeled point process and then marginalizing out the missing value process from Boluki, Dadaneh, Qian, and Dougherty (2019). However, it assumes that the data are MCAR.…”
Section: Discussionmentioning
confidence: 99%
“…We used a standard imputation‐followed‐by‐clustering approach, and it could be argued that neither the filter (imputation) nor the decision (clustering) were optimal. Recent work proposed a model‐based optimal clustering approach for data from microarrays and RNA sequencing (RNA‐seq) by incorporating the missing value mechanism into a random labeled point process and then marginalizing out the missing value process from Boluki, Dadaneh, Qian, and Dougherty (2019). However, it assumes that the data are MCAR.…”
Section: Discussionmentioning
confidence: 99%
“…The idea of clustering involves grouping similar data set which can then be used to address the missing vales [78]. A simple type of clustering is "imputation" where missing values are either replaces with zero or average values of row/column [78][79][80].…”
Section: Missing Data Issue For Small Citiesmentioning
confidence: 99%
“…After the workshop, eleven papers [1][2][3][4][5][6][7][8][9][10][11] were accepted for publication in the CNB-MAC 2018 partner journals: BMC Bioinformatics and BMC Genomics. In the following we provide a brief summary of these selected papers.…”
Section: Research Papers Presented At Cnb-mac 2018mentioning
confidence: 99%
“…Missing values can complicate the application of clustering algorithms, whose goal is to group data points based on some similarity criterion. In [2], Boluki et al consider missing values in the context of optimal clustering, which finds an optimal clustering operator with reference to an underlying random labeled point process. They incorporate the missing value mechanism into the random labeled point process, and obtain the optimal clustering operator by marginalizing out the missing-value process.…”
Section: Research Papers Presented At Cnb-mac 2018mentioning
confidence: 99%