Background
Lymph node metastasis (LNM) is an important prognostic factor for cervical cancer (CC) and determines the treatment strategy. Hematological indicators have been reported as being useful biomarkers for the prognosis of a variety of cancers. This study aimed to evaluate the feasibility of machine learning models characterized by preoperative hematological indicators to predict the LNM status of CC patients before surgery.
Methods
The clinical data of 236 patients with pathologically confirmed CC were retrospectively analyzed at the Gynecology Oncology Department of the First Affiliated Hospital of Bengbu Medical College from November 2020 to August 2022. Recursive feature elimination (RFE) was used to select 12 features from 35 hematological indicators and for the construction of 6 machine learning predictive models, including Adaptive Boosting (AdaBoost), Gaussian Naive Bayes (GNB), and Logistic Regression (LR), as well as Random Forest (RF), Support Vector Machines (SVM), and Extreme Gradient Boosting (XGBoost). Evaluation metrics of predictive models included the area under the receiver operating characteristic curve (AUC), accuracy, precision, recall, and F1-score.
Results
There was no significant difference in the 10-fold cross-validated AUC between XGBoost and RF (XGBoost, AUC = 0.903 vs. RF, AUC = 0.908; P = 0.889, DeLong test). XGBoost showed the best overall predictive performance. The specific performance indicators of XGBoost were AUC (0.903, 95% CI: 0.829–0.977), accuracy (0.841, 95% CI: 0.759–0.923), precision (0.850, 95% CI: 0.774–0.926), recall (0.837, 95% CI: 0.755–0.919), and F1-score (0.832, 95% CI: 0.739–0.925).
Conclusions
XGBoost and RF based on preoperative hematological indicators that are easily available in clinical practice showed superior performance in the preoperative prediction of CC LNM. However, investigations on larger external cohorts of patients are required for further validation of our findings.