Background
Helicobacter pylori
has a high infection rate worldwide, and epidemiological study of
H. pylori
is important. Artificial intelligence has been widely used in the field of medical research and has become a hotspot in recent years. This paper proposed a prediction model for
H. pylori
infection based on machine learning in adults.
Material/Methods
Adult patients were selected as research participants, and information on 30 factors was collected. The chi-square test, mutual information, ReliefF, and information gain were used to screen the feature factors and establish 2 subsets. We constructed an
H. pylori
infection prediction model based on XGBoost and optimized the model using a grid search by analyzing the correlation between features. The performance of the model was assessed by comparing its accuracy, recall, precision, F1 score, and AUC with those of 4 other classical machine learning methods.
Results
The model performed better on the part B subset than on the part A subset. Compared with the other 4 machine learning methods, the model had the highest accuracy, recall, F1 score, and AUC. SHAP was used to evaluate the importance of features in the model. It was found that
H. pylori
infection of family members, living in rural areas, poor washing hands before meals and after using the toilet were risk factors for
H. pylori
infection.
Conclusions
The model proposed in this paper is superior to other models in predicting
H. pylori
infection and can provide a scientific basis for identifying the population susceptible to
H. pylori
and preventing
H. pylori
infection.