2019
DOI: 10.1016/j.cardfail.2019.01.018
|View full text |Cite
|
Sign up to set email alerts
|

Predictive Abilities of Machine Learning Techniques May Be Limited by Dataset Characteristics: Insights From the UNOS Database

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

2
51
0
1

Year Published

2019
2019
2025
2025

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 55 publications
(54 citation statements)
references
References 17 publications
2
51
0
1
Order By: Relevance
“…Prior studies have assessed the performance of clinical predictive models, finding, as in our study, that machine learning methods performed equivalently to standard regression analyses . Although advanced analytic methods and traditional regression models have comparable discrimination, model performance is often influenced by both the size of the cohort under study and the number of events per variable (EPV) .…”
Section: Discussionmentioning
confidence: 72%
See 1 more Smart Citation
“…Prior studies have assessed the performance of clinical predictive models, finding, as in our study, that machine learning methods performed equivalently to standard regression analyses . Although advanced analytic methods and traditional regression models have comparable discrimination, model performance is often influenced by both the size of the cohort under study and the number of events per variable (EPV) .…”
Section: Discussionmentioning
confidence: 72%
“…Prior studies have assessed the performance of clinical predictive models, finding, as in our study, that machine learning methods performed equivalently to standard regression analyses. [27][28][29] Although advanced analytic methods and traditional regression models have comparable discrimination, model performance is often influenced by both the size of the cohort under study and the number of events per variable (EPV). [30][31][32][33] Evidence suggests that logistic regression models perform better (in terms of accuracy, parsimony and/or discrimination) in smaller datasets with approximately 20-50 EPV, while random forest models perform well with larger sample sizes and achieve sufficient stability when EPV exceeds 200.…”
Section: Discussionmentioning
confidence: 99%
“…First, large multicenter registries, like the UNOS dataset, were the main cohort source for the derivate models. The lack of granularity of data included in these registries may be the most important limitation 35 . The UNOS Thoracic Committee recently decided to expand the collection of data to capture more prognostic markers in order to improve risk stratification.…”
Section: Discussionmentioning
confidence: 99%
“…The analysis of complex interactions between predictive variables may improve risk stratification (eg, donor age and ischemic time) 36‐38 . However, different machine learning approaches failed to improve the discrimination ability of predictive models 35 . We believe that international collaborations, at the level of centers, to build prospective prediction models based on a deep phenotyped database may increase the granularity of the dataset, heterogeneity of allocation schemes and practices, and finally, the statistical performance of these models.…”
Section: Discussionmentioning
confidence: 99%
“…Our findings are consistent with a study in heart transplantation by Miller et al . [34] that found no meaningful difference in predicting 1‐year survival between logistic regression and ML algorithms using the same set of variables, with C ‐statistics around 0.65 in most methods. We have extended this approach to kidney transplantation, to outcomes beyond 1 year, to Cox regression which is the typical method for evaluating survival, and to nonsurvival outcomes such as DGF and AR.…”
Section: Discussionmentioning
confidence: 99%