2004
DOI: 10.1002/sim.1687
|View full text |Cite
|
Sign up to set email alerts
|

Inflation of the type I error rate when a continuous confounding variable is categorized in logistic regression analyses

Abstract: This paper demonstrates an inflation of the type I error rate that occurs when testing the statistical significance of a continuous risk factor after adjusting for a correlated continuous confounding variable that has been divided into a categorical variable. We used Monte Carlo simulation methods to assess the inflation of the type I error rate when testing the statistical significance of a risk factor after adjusting for a continuous confounding variable that has been divided into categories. We found that t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

2
116
1
5

Year Published

2004
2004
2024
2024

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 197 publications
(124 citation statements)
references
References 24 publications
2
116
1
5
Order By: Relevance
“…Stratification by categories (e.g., 5 categories) of the confounder may adequately control for the confounding by continuous confounders as well, 17 although in extreme settings this may not hold. 21 In our clinical examples, control for continuous confounders was indeed similar when stratifying using 5 strata or when using fractional polynomials or restricted cubic splines. It should be noted that the results from these methods differed compared with adjustment by stratifying on the dichotomized continuous confounder.…”
Section: Discussionmentioning
confidence: 67%
“…Stratification by categories (e.g., 5 categories) of the confounder may adequately control for the confounding by continuous confounders as well, 17 although in extreme settings this may not hold. 21 In our clinical examples, control for continuous confounders was indeed similar when stratifying using 5 strata or when using fractional polynomials or restricted cubic splines. It should be noted that the results from these methods differed compared with adjustment by stratifying on the dichotomized continuous confounder.…”
Section: Discussionmentioning
confidence: 67%
“…There was no evidence to suggest that data were normally distributed, hence in the descriptive statistics for continuous variables, we report median and inter-quartile range. To avoid inflating the type-I error rate, loss of power, residual confounding, and bias, continuous predictor variables were not categorised (Del Priore et al, 1997;Austin and Brunner, 2004;Royston et al, 2006). To test any differences we used a non-parametric Wilcoxon rank sum (Mann -Whitney) test.…”
Section: Discussionmentioning
confidence: 99%
“…Dichotomization is very common in clinical studies, since it satisfies the general need in clinical practice to label individuals as having or not having a characteristic and to decide the treatment strategy. However, converting all patients into two categories has some intrinsic problems, such as loss of information and statistical power, increased probability of false positive results, and impossibility of detecting the shape of the relationship [15][16][17][18]. In addition, in the absence of a priori information, ''optimal'' cutpoints are usually obtained by trying many thresholds and choosing the one which, to some extent, gives the most satisfactory result, for example, choosing the value which minimizes the P-value of the Log-rank test [13].…”
Section: Discussionmentioning
confidence: 99%
“…For example, in studies on MBC, a cutpoint of 5 CTCs has been generally adopted [3,4,13,14]. Dichotomization has some perceived advantages, such as simplicity, easy interpretation of the results, and functionality in clinical practice, but also some intrinsic problems, such as loss of information and statistical power, increased probability of false positive results, and impossibility of detecting nonlinear relationships between the predictor and the outcome [15][16][17][18].…”
Section: Introductionmentioning
confidence: 99%