2020
DOI: 10.1186/s41512-020-00074-3
|View full text |Cite
|
Sign up to set email alerts
|

State of the art in selection of variables and functional forms in multivariable analysis—outstanding issues

Abstract: How to select variables and identify functional forms for continuous variables is a key concern when creating a multivariable model. Ad hoc 'traditional' approaches to variable selection have been in use for at least 50 years. Similarly, methods for determining functional forms for continuous variables were first suggested many years ago. More recently, many alternative approaches to address these two challenges have been proposed, but knowledge of their properties and meaningful comparisons between them are s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
156
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
9
1

Relationship

2
8

Authors

Journals

citations
Cited by 162 publications
(158 citation statements)
references
References 138 publications
(143 reference statements)
2
156
0
Order By: Relevance
“…This process results in sparse fits similar to lasso-like approaches, with many of the estimated coefficients shrunk toward zero. Boosting algorithms have been shown to be particularly useful to handle models in which the number of candidate predictors exceeds the number of observations (high-dimensional settings) [51,52].…”
Section: Resultsmentioning
confidence: 99%
“…This process results in sparse fits similar to lasso-like approaches, with many of the estimated coefficients shrunk toward zero. Boosting algorithms have been shown to be particularly useful to handle models in which the number of candidate predictors exceeds the number of observations (high-dimensional settings) [51,52].…”
Section: Resultsmentioning
confidence: 99%
“…Because this last analysis was intended as a descriptive model rather than a predictive model, variables weakly associated with the FATCOD, Form B were not removed. 24 Missing values were handled by multiple imputation: instead of being replaced by a single value, missing values are replaced by several values selected at random from a distribution determined using a model (15 imputations for this study). Statistical analyses were performed using R V.3.6.1 ( www.r-project.org ), and the mice package V.3.5.0 was used for the imputation.…”
Section: Methodsmentioning
confidence: 99%
“…A decision on which variable to include in a multivariate analysis can depend on different criteria, or cutoff (determinants screening signi cance) than the signi cance level [19,20,21]. In addition, some variables can adjust the effects of other variables in the model, even if they do not have any low p-value themselves [22]. In the present study, all the potential risk factors were considered for multivariable analysis presented in Table 3 to detect the direction and extent to which variable categories explain the difference in prevalence of canine helminthiases.…”
Section: Characteristics Of Study Dogsmentioning
confidence: 99%