A R T I C L E I N F O Keywords: (MeSH terms): Bias Cluster analysis Multivariable analysis Randomized controlled trials as topic Research design A B S T R A C TIndividual-level baseline covariate imbalance could happen more frequently in cluster randomized trials, and may influence the observed treatment effect. Using computer and real-data simulations, this paper quantifies the extent and impact of covariate imbalance on the estimated treatment effect for both continuous and binary outcomes, and relates it to the degree of imbalance for different numbers of clusters, cluster sizes, and covariate intraclass correlation coefficients. We focused on the impact of race as a covariate, given the emphasis of regulatory and funding bodies on understanding the influence of demographic characteristics on treatment effectiveness. We found that bias in the treatment effect is proportional to both the degree of baseline covariate imbalance and the covariate effect size. Larger numbers of clusters result in lower covariate imbalance, and increasing cluster size is less effective in reducing imbalance compared to increasing the number of clusters. Models adjusted for important baseline confounders are superior to unadjusted models for minimizing bias in both model-based simulations and an innovative simulation based on real clinical trial data. Higher outcome intraclass correlation coefficients did not affect bias but resulted in greater variance in treatment estimates. outcomes) or odds ratio (for binary outcomes) suffices as a measure of treatment effect. In practice, results for many CRTs have also been reported using similar (unadjusted) estimators [4,5]. However, the internal validity of CRTs is challenged when treatment arms are not comparable. Brierley et al. [6] found that about 30% of CRTs may suffer from selection bias, resulting in substantial covariate imbalance. Previous literature recommended that researchers could control selection bias in CRTs via cautious design, such as identifying patients prior to randomization and blinding [7][8][9]. Aside from selection bias, another important source of covariate imbalance is random chance, and that is the focus of the current paper. We show that baseline covariate imbalance may happen more frequently in CRTs compared to RCTs. If the covariates are predictive of the clinical outcomes, they act as confounders and may cause a spurious association between intervention and clinical outcome when not accounted for properly. As a result, it is important to understand how to accurately account for important individual-level covariates in the design and analysis of CRTs where confounders typically can impact the outcome.