2021
DOI: 10.1038/s41598-020-79317-8
|View full text |Cite
|
Sign up to set email alerts
|

Model selection for inferential models with high dimensional data: synthesis and graphical representation of multiple techniques

Abstract: Inferential research commonly involves identification of causal factors from within high dimensional data but selection of the ‘correct’ variables can be problematic. One specific problem is that results vary depending on statistical method employed and it has been argued that triangulation of multiple methods is advantageous to safely identify the correct, important variables. To date, no formal method of triangulation has been reported that incorporates both model stability and coefficient estimates; in this… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
21
0

Year Published

2021
2021
2025
2025

Publication Types

Select...
5
1
1

Relationship

1
6

Authors

Journals

citations
Cited by 17 publications
(22 citation statements)
references
References 28 publications
1
21
0
Order By: Relevance
“…In terms of the substantial improvements in accuracy of covariate selection, the additional time required would appear to be very worthwhile. Also worthy of note in our results was the excellent performance of the combination modelling method; this is in agreement with previous research 19 . For all datasets this method provided the most accurate variable selection and this is likely to be due to the principle of triangulation.…”
Section: Discussionsupporting
confidence: 93%
See 3 more Smart Citations
“…In terms of the substantial improvements in accuracy of covariate selection, the additional time required would appear to be very worthwhile. Also worthy of note in our results was the excellent performance of the combination modelling method; this is in agreement with previous research 19 . For all datasets this method provided the most accurate variable selection and this is likely to be due to the principle of triangulation.…”
Section: Discussionsupporting
confidence: 93%
“…For each method, a conventional approach to covariate selection was conducted followed by implementation of selection stability by bootstrapping 14 . The modelling methods chosen are acknowledged approaches for inferential modelling and were stepwise selection based on Akaike information criterion (sAIC) 17 , stepwise selection based on a modified Bayesian Information Criterion (mBIC) 5 , elastic net regression (enet) 8 , minimax convex penalty regression (MCP) 18 and a combination method that synthesised results from all four methods 19 . These modelling techniques are described in detail in Section 2.1 and the new approach to stability selection, including the determination of a threshold for inference, is described in Section 2.2.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…Triangulation (Lawlor et al, 2017;Lima et al, 2021) of results from elastic net regression (Enet), a form of regularized logistic regression, and logistic regression using modified Bayesian information criterion (mBIC) was used to establish important risk factors for lameness. These methods were chosen due to the large number of predictors and the need to avoid overfitting.…”
Section: Variable Selection Modelsmentioning
confidence: 99%