Purpose
Maintenance hemodialysis (MHD), which can cause various complications, is a common alternative therapy for patients with ESRD. This research built a prediction model of hypoproteinemia among ESRD patients based on machine learning algorithms.
Method
A total of 468 patients were selected as subjects. The “hypoproteinemia risk factor data extraction table” was drawn up after a literature review. Univariate analysis was used to screen independent risk factors as prediction variables. After hyper parameter adjustment by k-fold (k = 5) cross-validation and grid search, random forest (RF), support vector machine (SVM), back propagation (BP) neural network and logistic regression (LR) prediction models were developed. The model was evaluated by 6 dimensions, including AUROC, accuracy, precision, sensitivity, specificity and F1 score, and an importance matrix diagram was used to describe the importance.
Result
The incidence of hypoproteinemia in total was 30.8%. According to univariate analysis, the difference between the hypoproteinemia and nonhypoproteinemia groups was significant in 18 aspects, including age, weight, dialysis duration, and dialysis frequency. In the training set, the AUROC values of the RF, SVM, and LR models were all greater than 0.8 unlike the BP neural network (0.798). The RF model had the highest AUC value (0.924). The specificities of the LR and RF models were similar (0.846 and 0.839, respectively), while the RF model had the best accuracy (0.924) and balanced F1 score (0.751). The models had higher performance indexes in the test set than in the training set, with the RF and BP models performing better in AUROC (0.981, 0.948) and the RF model being better in accuracy, specificity balanced F1 score and precision. The top 5 prediction variables were hypersensitivity C reactive protein, age, weight, usage of high-throughput dialyzers, and dialysis age.
Conclusion
The RF model performed best. The model could help recognize characteristics related to hypoproteinemia during clinical practice, thereby enhancing nurses’ risk perception and improving accurate screening, primary prevention and early intervention.