2016
DOI: 10.4172/2161-1165.1000227
|View full text |Cite
|
Sign up to set email alerts
|

Multicollinearity in Regression Analyses Conducted in Epidemiologic Studies

Abstract: The adverse impact of ignoring multicollinearity on findings and data interpretation in regression analysis is very well documented in the statistical literature. The failure to identify and report multicollinearity could result in misleading interpretations of the results. A review of epidemiological literature in PubMed from January 2004 to December 2013, illustrated the need for a greater attention to identifying and minimizing the effect of multicollinearity in analysis of data from epidemiologic studies. … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

7
533
3
5

Year Published

2017
2017
2023
2023

Publication Types

Select...
10

Relationship

0
10

Authors

Journals

citations
Cited by 754 publications
(596 citation statements)
references
References 29 publications
7
533
3
5
Order By: Relevance
“…It seems that for the research question being discussed here the inclusion of years worked is an inappropriate approach to adjust for left truncation. Rather, the simultaneous inclusion of two highly correlated variables in the model results in biased risk estimates (Yoo et al 2014;Vatcheva et al 2016).…”
Section: Description Of the Study And Authors' Main Resultsmentioning
confidence: 99%
“…It seems that for the research question being discussed here the inclusion of years worked is an inappropriate approach to adjust for left truncation. Rather, the simultaneous inclusion of two highly correlated variables in the model results in biased risk estimates (Yoo et al 2014;Vatcheva et al 2016).…”
Section: Description Of the Study And Authors' Main Resultsmentioning
confidence: 99%
“…To account for confounders, potential risk factors (age, gender, and educational and occupational level) with p < 0.25 in bivariate analysis were further assessed with a multivariable logistic regression analysis according to the method suggested by Bursac et al [15]. We assessed for multicollinearity among the independent variables included in the multivariable analysis (age, BMI [as a continuous variable], and diastolic and systolic blood pressures [BP]) using the variance inflation factor [16]. The highest variance inflation factor (i.e., 2.4) was noted between age (as the dependent variable) and systolic and diastolic BP (as independent variables), and BMI (as the dependent variable) and systolic and diastolic BP in the independent variable list.…”
Section: Methodsmentioning
confidence: 99%
“…Model 3 consisted of sociodemographic, behavioural and biological factors including HIV, and number of pregnancies (only in female model). Multicollinearity was assessed using the Variance Inflation Factor (VIF) [34]. In general, a VIF ≥ 5 suggests multicollinearity between variables [35].…”
Section: Methodsmentioning
confidence: 99%