Aims
To analyse the predictive capacity of 15 machine learning methods for estimating cardiovascular risk in a cohort and to compare them with other risk scales.
Methods
We calculated cardiovascular risk by means of 15 machine‐learning methods and using the SCORE and REGICOR scales and in 38 527 patients in the Spanish ESCARVAL RISK cohort, with 5‐year follow‐up. We considered patients to be at high risk when the risk of a cardiovascular event was over 5% (according to SCORE and machine learning methods) or over 10% (using REGICOR). The area under the receiver operating curve (AUC) and the C‐index were calculated, as well as the diagnostic accuracy rate, error rate, sensitivity, specificity, positive and negative predictive values, positive likelihood ratio, and number needed to treat to prevent a harmful outcome.
Results
The method with the greatest predictive capacity was quadratic discriminant analysis, with an AUC of 0.7086, followed by Naive Bayes and neural networks, with AUCs of 0.7084 and 0.7042, respectively. REGICOR and SCORE ranked 11th and 12th, respectively, in predictive capacity, with AUCs of 0.63. Seven machine learning methods showed a 7% higher predictive capacity (AUC) as well as higher sensitivity and specificity than the REGICOR and SCORE scales.
Conclusions
Ten of the 15 machine learning methods tested have a better predictive capacity for cardiovascular events and better classification indicators than the SCORE and REGICOR risk assessment scales commonly used in clinical practice in Spain. Machine learning methods should be considered in the development of future cardiovascular risk scales.