2015
DOI: 10.5784/31-1-162
|View full text |Cite
|
Sign up to set email alerts
|

The impact of pre-selected variance inflation factor thresholds on the stability and predictive power of logistic regression models in credit scoring

Abstract: Standard Bank, South Africa, currently employs a methodology when developing application or behavioural scorecards that involves logistic regression. A key aspect of building logistic regression models entails variable selection which involves dealing with multicollinearity. The objective of this study was to investigate the impact of using different variance inflation factor 1 (VIF) thresholds on the performance of these models in a predictive and discriminatory context and to study the stability of the estim… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
19
0

Year Published

2016
2016
2024
2024

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 32 publications
(19 citation statements)
references
References 18 publications
0
19
0
Order By: Relevance
“…Given the dynamic and time-sensitive nature of care exchange, we are also careful to report and discuss the results in terms of association rather than causality. In both sets of models, the variance inflation factor test was conducted and the variance inflation factor values were well below the conservative threshold of 2.5 for all our key variables (de Jongh et al, 2015).…”
Section: Methodsmentioning
confidence: 97%
“…Given the dynamic and time-sensitive nature of care exchange, we are also careful to report and discuss the results in terms of association rather than causality. In both sets of models, the variance inflation factor test was conducted and the variance inflation factor values were well below the conservative threshold of 2.5 for all our key variables (de Jongh et al, 2015).…”
Section: Methodsmentioning
confidence: 97%
“…Applying both the VIF and correlation thresholds was felt to sufficiently address any multicollinearity and the thresholds acceptable for large sample sizes. 25 The final best model was created using only the training data sample.…”
Section: Methodsmentioning
confidence: 99%
“…For FFD and SRD, log-transformed values were analyzed to fulfill regression model assumptions, including random distribution of residuals and good fit over the entire scale of the dependent variable. The multicollinearity was inspected with the vif-function (variance inflation factor) of the car-package and was not found to be problematic for any of the analyses (< 3.1;De Jongh et al, 2015). The observed statistical power (simr -package; Green & MacLeod, 2016) in the lmer-analyses was optimal (90-95%) for testing two-way interactions, but low for testing the three-way interaction (30-40%).…”
Section: Analysesmentioning
confidence: 99%