Objective
This study aims to integrate body composition data in developing a random survival forest (RSF) model for predicting overall survival (OS) in locally advanced cervical cancer (LACC) patients treated with definitive chemoradiotherapy (CRT). Methods: 374 LACC patients from the First Affiliated Hospital of Soochow University were enrolled in the study. Eleven features were selected using least absolute shrinkage and selection operator (LASSO) regression in constructing the RSF model. The Optuna algorithm was applied to optimize the hyperparameters. The constructed RSF model was evaluated based on the C-index, AUC, Brier scores, and decision curve analysis (DCA). Furthermore, we compared the RSF model with the traditional Cox regression and deep learning (Deepsurv) models.
Results: The C-index of the RSF model in the test set was 0.809 (95%CI: 0.708–0.911), superior to the Cox regression (0.739, 95%CI: 0.627–0.852) and Deepsurv (0.774, 95%CI: 0.656–0.892) models. The Brier scores for 1-, 3-, and 5-year OS in the test set (0.044, 0.069, and 0.096) and DCA demonstrated that the RSF model has good calibration and clinical benefits. Furthermore, skeletal muscle density (SMD) ranked top in visualization using SHapley Additive exPlanations (SHAP) plotting. Lastly, we applied the RSF model in risk stratification, which showed a favorable performance.
Conclusions: The RSF model, which is based on body composition characteristics, performs well in predicting OS in LACC patients treated with definitive CRT. Moreover, the RSF model demonstrates value in risk stratification.