2021
DOI: 10.31234/osf.io/uaezh
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Best Practices for Addressing Missing Data through Multiple Imputation

Abstract: A common challenge in developmental research is the amount of incomplete and missing data that occurs from respondents failing to complete tasks or questionnaires, as well as from disengaging from the study (i.e., attrition). This missingness can lead to biases in parameter estimates and, hence, in the interpretation of findings. These biases can be addressed through statistical techniques that adjust for missing data, such as multiple imputation. Although this technique is highly effective, it has not been wi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
8
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
9
1

Relationship

1
9

Authors

Journals

citations
Cited by 14 publications
(8 citation statements)
references
References 69 publications
0
8
0
Order By: Relevance
“…Discharge status (yes/no) was modeled as a covariate predictor of all endogenous variables and allowed to covary with the MST arbitrary codes; discharge status was the only demographic characteristic with a significant bivariate association with suicidal ideation. We used full information maximum likelihood to estimate the model using all available data, which is preferred to listwise deletion (Woods et al, 2021). Indirect effects were estimated in Mplus using the Model Indirect command, and 95 percentile confidence intervals using 10,000 bootstrap samples were generated.…”
Section: Methodsmentioning
confidence: 99%
“…Discharge status (yes/no) was modeled as a covariate predictor of all endogenous variables and allowed to covary with the MST arbitrary codes; discharge status was the only demographic characteristic with a significant bivariate association with suicidal ideation. We used full information maximum likelihood to estimate the model using all available data, which is preferred to listwise deletion (Woods et al, 2021). Indirect effects were estimated in Mplus using the Model Indirect command, and 95 percentile confidence intervals using 10,000 bootstrap samples were generated.…”
Section: Methodsmentioning
confidence: 99%
“…Participants who had at least one missing data point had lower reading and phonological awareness scores (small effect size) and lower spelling scores (medium effect size) at the start of grade 3 than participants with complete data, i.e., the data were Missing at Random. Since listwise deletion would drastically reduce the sample size and bias the estimates (Woods et al 2021), we imputed the missing data using predictive mean matching in mice (multiple imputation by chained equations) (Van Buuren & Groothuis-Oudshoorn 2011). We imputed 10 datasets with 10 iterations each and saved the final model's final iteration as the dataset we worked with.…”
Section: Methodsmentioning
confidence: 99%
“…the data was Missing at Random. Since listwise deletion would drastically reduce the sample size and bias the estimates (Woods et al 2021), we imputed the missing data using predictive mean matching in mice (multiple imputation by chained equations) (van Buuren & Groothuis-Oudshoorn 2011). We imputed 10 datasets with 10 iterations each and saved the final model's final iteration as the dataset we worked with.…”
Section: Methodsmentioning
confidence: 99%