Background We aimed to construct simple and practical metabolic syndrome (MetS) risk prediction models based on the data of inhabitants of Urumqi and to provide a methodological reference for the prevention and control of MetS. Methods This is a cross-sectional study conducted in the Xinjiang Uygur Autonomous Region of China. We collected data from inhabitants of Urumqi from 2018 to 2019, including demographic characteristics, anthropometric indicators, living habits and family history. Resampling technology was used to preprocess the data imbalance problems, and then MetS risk prediction models were constructed based on logistic regression (LR) and decision tree (DT). In addition, nomograms and tree diagrams of DT were used to explain and visualize the model. Results Of the 25,542 participants included in the study, 3,267 (12.8%) were diagnosed with MetS, and 22,275 (87.2%) were diagnosed with non-MetS. Both the LR and DT models based on the random undersampling dataset had good AUROC values (0.846 and 0.913, respectively). The accuracy, sensitivity, specificity, and AUROC values of the DT model were higher than those of the LR model. Based on a random undersampling dataset, the LR model showed that exercises such as walking (OR=0.769) and running (OR= 0.736) were protective factors against MetS. Age 60 ~ 74 years (OR=1.388), previous diabetes (OR=8.902), previous hypertension (OR=2.830), fatty liver (OR=3.306), smoking (OR=1.541), high systolic blood pressure (OR=1.044), and high diastolic blood pressure (OR=1.072) were risk factors for MetS; the DT model had 7 depth layers and 18 leaves, with BMI as the root node of the DT being the most important factor affecting MetS, and the other variables in descending order of importance: SBP, previous diabetes, previous hypertension, DBP, fatty liver, smoking, and exercise. Conclusions Both DT and LR MetS risk prediction models have good prediction performance and their respective characteristics. Combining these two methods to construct an interpretable risk prediction model of MetS can provide methodological references for the prevention and control of MetS.
Objective The internal workings ofmachine learning algorithms are complex and considered as low-interpretation "black box" models, making it difficult for domain experts to understand and trust these complex models. The study uses metabolic syndrome (MetS) as the entry point to analyze and evaluate the application value of model interpretability methods in dealing with difficult interpretation of predictive models. Methods The study collects data from a chain of health examination institution in Urumqi from 2017 ~ 2019, and performs 39,134 remaining data after preprocessing such as deletion and filling. RFE is used for feature selection to reduce redundancy; MetS risk prediction models (logistic, random forest, XGBoost) are built based on a feature subset, and accuracy, sensitivity, specificity, Youden index, and AUROC value are used to evaluate the model classification performance; post-hoc model-agnostic interpretation methods (variable importance, LIME) are used to interpret the results of the predictive model. Results Eighteen physical examination indicators are screened out by RFE, which can effectively solve the problem of physical examination data redundancy. Random forest and XGBoost models have higher accuracy, sensitivity, specificity, Youden index, and AUROC values compared with logistic regression. XGBoost models have higher sensitivity, Youden index, and AUROC values compared with random forest. The study uses variable importance, LIME and PDP for global and local interpretation of the optimal MetS risk prediction model (XGBoost), and different interpretation methods have different insights into the interpretation of model results, which are more flexible in model selection and can visualize the process and reasons for the model to make decisions. The interpretable risk prediction model in this study can help to identify risk factors associated with MetS, and the results showed that in addition to the traditional risk factors such as overweight and obesity, hyperglycemia, hypertension, and dyslipidemia, MetS was also associated with other factors, including age, creatinine, uric acid, and alkaline phosphatase. Conclusion The model interpretability methods are applied to the black box model, which can not only realize the flexibility of model application, but also make up for the uninterpretable defects of the model. Model interpretability methods can be used as a novel means of identifying variables that are more likely to be good predictors.
Background Close contacts of active pulmonary tuberculosis patients are high-risk groups for tuberculosis. Through active screening of contacts, more contacts who have recently been exposed to pulmonary tuberculosis patients, early existing pulmonary tuberculosis patients, and latent infections can be found. Methods The decision tree Markov model of close contacts screening strategy was established to simulate the cohort of close contacts of tuberculosis. The cost effects of passive screening strategy and active screening once, twice, and once a year were compared. Single-factor and probability sensitivity analyses were conducted to test the impact of assumptions and parameter estimates in the model on cost-effectiveness analysis. Results Compared with passive screening, active screening once, twice, and once a year could reduce the incidence of latent infection by 0.21%, 14.286%, and 63.48%, respectively; Compared with the passive screening strategy, active screening once, twice, and once a year can reduce the incidence of active tuberculosis by 4.13%, 11.22%, and 50.04% respectively; Compared with the passive screening strategy, the active screening once, twice and once a year strategy reduced the incidence of TB deaths by 2.86%, 5.71%, and 11.43% respectively. Compared with the passive screening strategy, active screening once, active screening twice, and active screening once a year will cost 8800.43 RMB, 5781.70 RMB, and 13825.04 RMB more for each additional QALY, which is lower than the willingness of Chinese people to pay and is less than twice the GDP of Xinjiang. The increased cost of obtaining an additional QALY is entirely worth it. All these are advantageous strategies. Conclusion Compared with the passive screening strategy, the cost and effect of the active screening strategy of once, twice, and once a year increase in turn, which can reduce the incidence of LTBI, tuberculosis, and tuberculosis death, and are all advantageous strategies. Continuous active screening of tuberculosis in critical populations is one of the key measures to quickly reduce the epidemic situation of tuberculosis. It is recommended to carry out regular screening for all close contacts of active tuberculosis patients. Reasonably allocate health resources, and propose reasonable screening methods for different age groups and critical groups.
Background Vitamin D is related to human immunity, so we used Bayesian network model to analyze and infer the relationship between vitamin D level and the acid-fast bacilli (AFB) smear-positive after two months treatment among pulmonary tuberculosis (TB) patients. Methods This is a cross-sectional study. 731 TB patients whose vitamin D level were detected and medical records were collected from December 2019 to December 2020 in XinJiang of China. Logistic regression was used to analyze the influencing factors of second AFB smear-positive. Bayesian network was used to further analyze the causal relationship among vitamin D level and the second AFB smear-positive. Results Baseline AFB smear-positive (OR = 6.481, 95%CI: 1.604~26.184), combined cavity (OR = 3.204, 95%CI: 1.586~6.472), full supervision (OR = 8.173, 95%CI:1.536~43.492) and full management (OR = 6.231, 95%CI:1.031~37.636) were not only the risk factors and can also be considered as the reasons for second AFB smear-positive in TB patients (Ensemnle > 0.5). There was no causal relationship between vitamin D level and second AFB smear-positive (Ensemnle = 0.0709). Conclusions The risk factors of second AFB smear-positive were baseline AFB smear-positive, combined cavity, full supervision and full management. The vitamin D level in TB patients was not considered as one of the reasons for the AFB smear-positive.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.