2007
DOI: 10.1111/j.1467-9531.2007.00180.x
|View full text |Cite
|
Sign up to set email alerts
|

4. Regression with Missing Ys: An Improved Strategy for Analyzing Multiply Imputed Data

Abstract: When fitting a generalized linear model-such as linear regression, logistic regression, or hierarchical linear modeling-analysts often wonder how to handle missing values of the dependent variable Y.If missing values have been filled in using multiple imputation, the usual advice is to use the imputed Y values in analysis. We show, however, that using imputed Ys can add needless noise to the estimates. Better estimates can usually be obtained using a modified strategy that we call multiple imputation, then del… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

3
826
1
3

Year Published

2011
2011
2024
2024

Publication Types

Select...
7
3

Relationship

0
10

Authors

Journals

citations
Cited by 1,287 publications
(833 citation statements)
references
References 27 publications
3
826
1
3
Order By: Relevance
“…Our multiple imputation model considers all analyzed variables in addition to all outcome variables. We used original unimputed values for our outcome variables, as suggested by the literature [32].…”
Section: Analysis Planmentioning
confidence: 99%
“…Our multiple imputation model considers all analyzed variables in addition to all outcome variables. We used original unimputed values for our outcome variables, as suggested by the literature [32].…”
Section: Analysis Planmentioning
confidence: 99%
“…Second, I recode the missing values as dummy variables. Third, I impute missing values on the independent variables using multiple imputed chained equations (Royston 2004;2005;von Hippel 2007). Because the results were not sensitive to the method of handling the missing data used, I report only the results from multiple imputed chained equations.…”
Section: Analytic Strategymentioning
confidence: 99%
“…The imputed values of the dependent variable did not provide additional error. 37 Therefore, we calculated our primary outcomes by using the 3 key variables (quit status, CO reading, and days since the last cigarette) with their imputed values for observed self-classified and 7-day prevalence quit rates from an average of 5 imputation series, which allowed for an SD 5% wider than if we had used an infinite number of imputations.…”
Section: Analysesmentioning
confidence: 99%