2020
DOI: 10.1038/s41598-020-68771-z
|View full text |Cite
|
Sign up to set email alerts
|

Early detection of type 2 diabetes mellitus using machine learning-based prediction models

Abstract: Most screening tests for T2DM in use today were developed using multivariate regression methods that are often further simplified to allow transformation into a scoring formula. The increasing volume of electronically collected data opened the opportunity to develop more complex, accurate prediction models that can be continuously updated using machine learning approaches. This study compares machine learning-based prediction models (i.e. Glmnet, RF, XGBoost, LightGBM) to commonly used regression models for pr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

2
114
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 245 publications
(133 citation statements)
references
References 47 publications
2
114
0
Order By: Relevance
“…We implemented the gradient-boosting predictor trained with the LightGBM [37] Python package. LightGBM has shown effectiveness on clinical and patient tabular data in particular, and was adopted by many recently published models [38][39][40][41][42][43]. Missing values were inherently handled by the LightGBM predictor [37,44,45].…”
Section: Development Of the Modelsmentioning
confidence: 99%
“…We implemented the gradient-boosting predictor trained with the LightGBM [37] Python package. LightGBM has shown effectiveness on clinical and patient tabular data in particular, and was adopted by many recently published models [38][39][40][41][42][43]. Missing values were inherently handled by the LightGBM predictor [37,44,45].…”
Section: Development Of the Modelsmentioning
confidence: 99%
“…While the DT model obtained the best recall values: 90% and 100%, and RF obtained the best AUC value: 83% for the PID dataset, and NB obtained the highest AUC value: 84% for the Kurmitola dataset. In contrast, Kopitar et al [17] utilized several ML models: Glmnet, RF, XGBoost, and LightGBM, to predict undiagnosed type 2 diabetes mellitus [17]. The authors obtained a dataset from EHR in 10 healthcare centers in Slovenia in which 3,723 participant records were applicable in their development to predict type 2 diabetes mellitus [17].…”
Section: A Traditional Machine Learning Techniquesmentioning
confidence: 99%
“…In contrast, Kopitar et al [17] utilized several ML models: Glmnet, RF, XGBoost, and LightGBM, to predict undiagnosed type 2 diabetes mellitus [17]. The authors obtained a dataset from EHR in 10 healthcare centers in Slovenia in which 3,723 participant records were applicable in their development to predict type 2 diabetes mellitus [17]. Their utilized model gained the highest performance regarding the lowest average RMSE value of 83.8% to predicting diabetes [17].…”
Section: A Traditional Machine Learning Techniquesmentioning
confidence: 99%
See 1 more Smart Citation
“…It shows strategic calculations were more important than precision. The restriction was that only Type 2 Diabetes was under consideration [19]. Younus et al proposed an algorithm that is based on random forest and attempted to detect the complicated areas of patients with type 2 diabetes.…”
Section: Literature Reviewmentioning
confidence: 99%