1999
DOI: 10.1177/096228029900800102
|View full text |Cite
|
Sign up to set email alerts
|

Multiple imputation: a primer

Abstract: In recent years, multiple imputation has emerged as a convenient and flexible paradigm for analysing data with missing values. Essential features of multiple imputation are reviewed, with answers to frequently asked questions about using the method in practice.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

17
1,952
0
33

Year Published

2005
2005
2017
2017

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 2,843 publications
(2,002 citation statements)
references
References 19 publications
17
1,952
0
33
Order By: Relevance
“…Sample sizes for analyses before MI ranged from N  = 46–93, and G*Power a priori power analyses estimated that with a small effect size (.20), an N  = 80–152 was needed for the repeated measures ANOVA, and N  = 386 for the MANOVA. Thus, missing data were replaced through multiple imputation, a frequently used process which replaces missing data through imputing, analysing, and pooling missing data (Schafer, 1999). Multiple imputation is a recommended process for handling missing data regardless of the type of missing data (that is, missing at random, missing completely at random, or missing not at random; Schafer, 1999).…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Sample sizes for analyses before MI ranged from N  = 46–93, and G*Power a priori power analyses estimated that with a small effect size (.20), an N  = 80–152 was needed for the repeated measures ANOVA, and N  = 386 for the MANOVA. Thus, missing data were replaced through multiple imputation, a frequently used process which replaces missing data through imputing, analysing, and pooling missing data (Schafer, 1999). Multiple imputation is a recommended process for handling missing data regardless of the type of missing data (that is, missing at random, missing completely at random, or missing not at random; Schafer, 1999).…”
Section: Methodsmentioning
confidence: 99%
“…Thus, missing data were replaced through multiple imputation, a frequently used process which replaces missing data through imputing, analysing, and pooling missing data (Schafer, 1999). Multiple imputation is a recommended process for handling missing data regardless of the type of missing data (that is, missing at random, missing completely at random, or missing not at random; Schafer, 1999). All analyses and multiple imputation procedures were conducted through IBM SPSS Statistics version 24.…”
Section: Methodsmentioning
confidence: 99%
“…Multiple imputation forms a set of complete datasets based on an imputation model, then uses an analytic model to assess intervention effects on each of the completed datasets. The imputation model used to replace the missing data should always be at least as complex as the analytic model used to examine intervention impact (Collins et al, 2001;Graham et al, 2006Graham et al, , 2007Schafer, 1997Schafer, , 1999Schafer and Graham, 2002).…”
Section: Handling Missing Data In Itt Analyses In Multilevel Rfts-mismentioning
confidence: 99%
“…These complete datasets are then analyzed using standard statistical methods, and inferences on such statistics as the odds ratio for GBG versus internal GBG control DISC diagnoses, are made by accounting for two sources of variation: the average standard errors of the odds ratios (within variation) and the variation in these odds ratios across the multiple imputations standard errors (between variation). Confidence intervals can also be formed according to Rubin (1987Rubin ( , 1996 and Schafer (1997Schafer ( , 1999.MI has some advantages over FIML since it can use this additional information to impute values from a large number of observed extra variables that never appear in the final analysis. FIML can also be used with a modest number of extra variables, collapsing over those not used in the final model as we did in Table 6.…”
Section: Handling Missing Data In Itt Analyses In Multilevel Rfts-mismentioning
confidence: 99%
“…In contrast, the second option can be realised through the direct likelihood approach, which is the likelihood-based way of using only the available information, see [19]. Various other (mostly nonparametric) methods of using only the observed data are discussed in [24], and for single and multiple imputation techniques in [22,24,28,29,30,31]. As mentioned in the introduction, the fourth option becomes necessary in the case of non-ignorability and MNAR.…”
Section: Approaches To the Analysis Of Recurrent Event Data With Dropoutmentioning
confidence: 99%