2013
DOI: 10.4236/ojs.2013.35043
|View full text |Cite
|
Sign up to set email alerts
|

Multiple Imputation of Missing Data: A Simulation Study on a Binary Response

Abstract: Currently, a growing number of programs become available in statistical software for multiple imputation of missing values. Among others, two algorithms are mainly implemented: Expectation Maximization (EM) and Multiple Imputation by Chained Equations (MICE). They have been shown to work well in large samples or when only small proportions of missing data are to be imputed. However, some researchers have begun to impute large proportions of missing data or to apply the method to small samples. A simulation was… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
21
0
1

Year Published

2016
2016
2022
2022

Publication Types

Select...
10

Relationship

1
9

Authors

Journals

citations
Cited by 32 publications
(23 citation statements)
references
References 28 publications
1
21
0
1
Order By: Relevance
“…The present findings are consistent with other research that has simulated MCAR and MAR binary data in randomized trials and found minimal bias with MI (Hardt et al 2013) and little difference between MI and CCA (Caille et al, in press, Ma et al 2011). However, these studies did not test MNAR conditions, where MI consistently outperformed CCA in the present study, nor did they test LOCF or WCS approaches.…”
Section: Discussionsupporting
confidence: 93%
“…The present findings are consistent with other research that has simulated MCAR and MAR binary data in randomized trials and found minimal bias with MI (Hardt et al 2013) and little difference between MI and CCA (Caille et al, in press, Ma et al 2011). However, these studies did not test MNAR conditions, where MI consistently outperformed CCA in the present study, nor did they test LOCF or WCS approaches.…”
Section: Discussionsupporting
confidence: 93%
“…Even the final test we conducted in the SPECTF heart dataset is not fully exhaustive of what a researcher may encounter in the real-life, as we considered only a MCAR mechanism to create the missing data. The conclusions we draw applies to cases with moderate sizes of missingness, no lower than 15 % and no higher than 30 %; we intentionally limited our evaluations to this range as for small amounts of missing data, under the MAR or MCAR mechanisms, imputation may be useless and for larger amounts caution should always be applied because estimates may become very imprecise [21]. Thus, despite the efficiency of NN imputation under these conditions, it should remembered that imputation should be carefully applied and cannot solve all the problems of incomplete data [22] and that NN imputation can have serious drawbacks as we showed for instance considering the risk of distorting data distribution or the lack of precision in imputing variables with no dependencies in a dataset or, conversely, the possibility to introduce spurious associations considering dependencies where they do not exist.…”
Section: Discussionmentioning
confidence: 99%
“…We assumed the mechanism leading to missing values to be missing at random (MAR), and therefore integrated multiple imputations into the analyses to minimize bias stemming from missing data. This method was very well suited to this task, regarding the sample size, the number of variables included in the imputation model, and the analyses to be conducted [ 61 , 62 , 63 , 64 ]. The imputation model contained raw data for all variables used in the analysis, before items were combined into scales or dichotomized.…”
Section: Methodsmentioning
confidence: 99%