2016
DOI: 10.1080/03610918.2014.983648
|View full text |Cite
|
Sign up to set email alerts
|

Clustered data with small sample sizes: Comparing the performance of model-based and design-based approaches

Abstract: MATHEMATICS SUBJECT CLASSIFICATION J; J ABSTRACTTwo classes of methods properly account for clustering of data: designbased methods and model-based methods. Estimates from both methods have been shown to be approximately equal with large samples. However, both classes are known to produce biased standard error estimates with small samples. This paper compares the bias of standard errors and statistical power of marginal effects for generalized estimating equations (a design-based method) and generalize… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

3
52
0
1

Year Published

2016
2016
2021
2021

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 72 publications
(56 citation statements)
references
References 28 publications
3
52
0
1
Order By: Relevance
“…As has been demonstrated in previous research (e.g., Browne & Draper, 2006;McNeish & Harring, 2015), ML and REML tended to have coverage intervals that are shorter than the nominal rate, especially for predictors involving a variable at Level 2. Use of a Kenward-Roger correction was largely able to address this limitation and provided coverage rates within the nominal  Prior to the DEFT correction, the coverage rate for the treatment effect with  to  observations per cluster was % and with  to  observations per cluster it was % across all number-of-cluster conditions.…”
Section: Multilevel Modelssupporting
confidence: 76%
See 1 more Smart Citation
“…As has been demonstrated in previous research (e.g., Browne & Draper, 2006;McNeish & Harring, 2015), ML and REML tended to have coverage intervals that are shorter than the nominal rate, especially for predictors involving a variable at Level 2. Use of a Kenward-Roger correction was largely able to address this limitation and provided coverage rates within the nominal  Prior to the DEFT correction, the coverage rate for the treatment effect with  to  observations per cluster was % and with  to  observations per cluster it was % across all number-of-cluster conditions.…”
Section: Multilevel Modelssupporting
confidence: 76%
“…Mancl-DeRouen had coverage rates that were much closer to nominal rates but were still a little too short, particularly for predictors at Level 1 with fewer than 10 clusters. Morel-Bokossa-Neerchal performed the best of all the GEE methods although the coverage rates tended to consistently be on the high end of Bradley's range, as was similarly found in McNeish and Harring (2015).…”
Section: Generalized Estimating Equationssupporting
confidence: 60%
“…When the effects were significant (α = 0.05), Tukey's Honest Significant Difference test was performed. Additionally, Kenward-Roger correction was applied for reducing small sample bias [26]. Statistical analyses were conducted using the packages "plyr", "ggplot2", "lme4" and "lmerTest" for the linear models; and the packages: "rmisc", "rcmdmisc", "plyr", "ggplot2", "car", "multcompView" and "lsmeans" for the linear mixed models implemented in the R software version 3.3.3 [27].…”
Section: Statistical Analysesmentioning
confidence: 99%
“…Several variations of CRSEs are available, at least for a single-level model (McNeish, 2014;McNeish & Harring, 2017;Raudenbush & Bryk, 2002). Mplus provides a CRSE procedure referred to as type = complex, which entails fitting a single-level model to data and correcting the standard errors for clustering at a higher level.…”
Section: Three Approaches To Analyze Ms-crt Datamentioning
confidence: 99%
“…A relatively recent approach, cluster bootstrapping, has been shown to produce results similar to those of multilevel modeling (MLM), but the results are computationally demanding, especially within a Monte Carlo simulation study (Huang, 2016). In brief, MLM, as a completely model-based approach, and generalized estimating equations (GEE) and clusterrobust standard errors (CRSE), as design-based approaches, are possible alternatives for analyzing MS-CRT data (McNeish, 2014;McNeish & Harring, 2017;McNeish & Wentzel, 2017). Among these three methods, MLM is currently the predominant method in the field of the social sciences (Bauer & Sterba, 2011;McNeish, Stapleton, & Silverman, 2017).…”
mentioning
confidence: 99%