BackgroundRheumatic heart disease (RHD) accounts for a large proportion of Intensive Care Unit (ICU) deaths. Early prediction of RHD can help with timely and appropriate treatment to improve survival outcomes, and the XGBoost machine learning technology can be used to identify predictive factors; however, its use has been limited in the past. We compared the performance of logistic regression and XGBoost in predicting hospital mortality among patients with RHD from the Medical Information Mart for Intensive Care IV (MIMIC-IV) database.MethodsThe patients with RHD in the MIMIC-IV database were divided into two groups retrospectively according to the availability of data and its clinical significance based on whether they survived or died. Backward stepwise regression was used to analyze the independent factors influencing patients with RHD, and to compare the differences between the two groups. The XGBoost algorithm and logistic regression were used to establish two prediction models, and the areas under the receiver operating characteristic curves (AUCs) and decision-curve analysis (DCA) were used to test and compare the models. Finally, DCA and the clinical impact curve (CIC) were used to validate the model.ResultsData on 1,634 patients with RHD were analyzed, comprising 207 who died during hospitalization and 1,427 survived. According to estimated results for the two models using AUCs [0.838 (95% confidence interval = 0.786–0.891) and 0.815 (95% confidence interval = 0.765–0.865)] and DCA, the logistic regression model performed better. DCA and CIC verified that the logistic regression model had convincing predictive value.ConclusionsWe used logistic regression analysis to establish a more meaningful prediction model for the final outcome of patients with RHD. This model might be clinically useful for patients with RHD and help clinicians to provide detailed treatments and precise management.