2018
DOI: 10.1177/0962280218784726
|View full text |Cite
|
Sign up to set email alerts
|

Sample size for binary logistic prediction models: Beyond events per variable criteria

Abstract: Binary logistic regression is one of the most frequently applied statistical approaches for developing clinical prediction models. Developers of such models often rely on an Events Per Variable criterion (EPV), notably EPV ≥10, to determine the minimal sample size required and the maximum number of candidate predictors that can be examined. We present an extensive simulation study in which we studied the influence of EPV, events fraction, number of candidate predictors, the correlations and distributions of ca… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

7
341
0
6

Year Published

2019
2019
2021
2021

Publication Types

Select...
7
2

Relationship

3
6

Authors

Journals

citations
Cited by 436 publications
(354 citation statements)
references
References 68 publications
7
341
0
6
Order By: Relevance
“…For studies validating prognostic models, there is no solid sample size recommendation, but it is recommended to consider at least the number of predictors, the total sample size and the event fraction. [42][43][44] The low number of events in our study might have . The horizontal gray line is the net benefit when all RSVinfected hospitalized adults are considered as not having the poor outcome; vertical gray line is the net benefit when all RSV-infected hospitalized adults are considered as having the poor outcome.…”
Section: Discussionmentioning
confidence: 79%
See 1 more Smart Citation
“…For studies validating prognostic models, there is no solid sample size recommendation, but it is recommended to consider at least the number of predictors, the total sample size and the event fraction. [42][43][44] The low number of events in our study might have . The horizontal gray line is the net benefit when all RSVinfected hospitalized adults are considered as not having the poor outcome; vertical gray line is the net benefit when all RSV-infected hospitalized adults are considered as having the poor outcome.…”
Section: Discussionmentioning
confidence: 79%
“…First, we had a limited amount of patients with the primary outcome. For studies validating prognostic models, there is no solid sample size recommendation, but it is recommended to consider at least the number of predictors, the total sample size and the event fraction . The low number of events in our study might have resulted in biased and less precise performance measures, which is also indicated by the broad CI of the reported C ‐statistics.…”
Section: Discussionmentioning
confidence: 98%
“…In our simulation without noise predictors, ridge MLR tended to yield models with better calibration and overall performance than lasso MLR. Though, the relative predictive performance of lasso MLR compared to ridge MLR may improve when the number of noise variables increases, as has recently been shown for binary logistic regression …”
Section: Discussionmentioning
confidence: 98%
“…While no consensus currently exists on the sample sizes required for prognostic prediction, the required sample size depends on several factors, including the number of predictors, total sample size, and number or proportion of events 5859. Thus, large sample sizes could be needed to reliably develop a prognostic model, especially when tens or hundreds of candidate predictor variables are considered.…”
Section: Threats To the Viability Of Rct Data Usementioning
confidence: 99%