2020
DOI: 10.1109/access.2020.3035026
|View full text |Cite
|
Sign up to set email alerts
|

Machine Learning-Based Application for Predicting Risk of Type 2 Diabetes Mellitus (T2DM) in Saudi Arabia: A Retrospective Cross-Sectional Study

Abstract: Earlier detection of individuals at the highest risk of developing diabetes is crucial to avoid the disease's prevalence and progression. Therefore, we aim to build a data-driven predictive application for screening subjects at a high risk of developing Type 2 Diabetes mellitus (T2DM) in the western region of Saudi Arabia. In this context, we designed and implemented a questionnairebased cross-sectional study using conventional diabetes risk factors for studying the prevalence and the association between the o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
32
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 52 publications
(32 citation statements)
references
References 62 publications
0
32
0
Order By: Relevance
“…RF uses a set of homogenous decision trees as its base classifiers while the EMV classifier was composed of the three simple learners LR, SVM, and DT, and used hard voting that considered the majority for predicting the class label for each instance in the test set. The rationale for choosing these is based on their previous performance reports in similar situations [9,31,32]. As our objective was to understand the factors contributing to the classification, we chose not to use any neural networks-based classifier in our analysis due to their "black-box" nature of interpretation of the model [34,57].…”
Section: Selection Of Machine Learning Classifiersmentioning
confidence: 99%
See 1 more Smart Citation
“…RF uses a set of homogenous decision trees as its base classifiers while the EMV classifier was composed of the three simple learners LR, SVM, and DT, and used hard voting that considered the majority for predicting the class label for each instance in the test set. The rationale for choosing these is based on their previous performance reports in similar situations [9,31,32]. As our objective was to understand the factors contributing to the classification, we chose not to use any neural networks-based classifier in our analysis due to their "black-box" nature of interpretation of the model [34,57].…”
Section: Selection Of Machine Learning Classifiersmentioning
confidence: 99%
“…These changes have led to an increasing rate of chronic diseases. Many studies conducted to address the rapid growth of Diabetes Mellitus have either the objective to quantify the status of diabetics in the country [3,5], identifying the most frequently performed self-care behaviors [6], identifying factors related to diabetes control [7], or apply mathematical [8] or machine learning models for diabetes prediction [9]. All these efforts are related to the increasing demand to enhance healthcare quality and control the elevated growth rate of diabetes in the kingdom.…”
Section: Introductionmentioning
confidence: 99%
“…However, Syed and Khan [57] developed utilized the Chi-Squared analysis and binary LR for analyzing and screening the most crucial diabetes uncertainty circumstance for T2DM risk prediction after utilizing the PID and NHNES datasets [57]. Furthermore, they also implemented a two-class decision forest model based on ensemble learning [57]. Their implemented model: decision forest obtained a satisfactory accuracy value of 82.1%, the precision value of 77.6%, recall value of 89%, F1-score value of 82.9%, and AUC value of 86.7%, respectively, for predicting T2DM.…”
Section: Ensemble Learning Techniquesmentioning
confidence: 98%
“…Their generated approach obtained an accuracy value of 90%, a recall value of 90.2%, and a precision value of 94.9% for predicting diabetes. However, Syed and Khan [57] developed utilized the Chi-Squared analysis and binary LR for analyzing and screening the most crucial diabetes uncertainty circumstance for T2DM risk prediction after utilizing the PID and NHNES datasets [57]. Furthermore, they also implemented a two-class decision forest model based on ensemble learning [57].…”
Section: Ensemble Learning Techniquesmentioning
confidence: 99%
“…The important advantage of diabetics care by collaboration team is to increase the efficiency of treatment of diabetics. Syed et al [12] proposed T2DM patient care. Experts evaluate the diet and begin with a retrospective evaluation.…”
Section: Literature Workmentioning
confidence: 99%