“…The publicly available dataset are mentioned in bold Sample number | ML models | Refs. |
Early diagnosis and prediction of diabetes |
T2DM |
15,005 subjects with age ≥ 3 | XGBoost, DNN, and RF | [ 63 ] |
1512 subjects | LR, RF, Naive Bayes (NB), SVM, XGBT, ANN, K-nearest neighbor (KNN), DT, XceptionResNet 50, DenseNet121, Vgg16, Vgg19, and InceptionV3, Stacking model of non-invasive variables and the Resnet50 model | [ 53 ] |
530 participants: 272 were diabetic patients and 258 were non-diabetic patients | Deep autoencoder learning algorithm with CNN networks and deep radial basis function neural network (RBFNN) classifier | [ 52 ] |
217 participants with diabetes, prediabetes and normal conditions | SVM, K-nearest neighbors, RF, XGBoost, hybrid feature selection-XGBoost | [ 91 ] |
2371 T1-weighted whole-body MRI data sets | DenseNet architecture | [ 54 ] |
8454 subjects over five years of follow- up | XGBoost, SVM, LR, RF, and ensemble algorithms | [ 64 ] |
16,429 men and non-pregnant women ≥ 20 years of age | ANN, LR, and RF models | [ 55 ] |
453,487 T2DM patients | Reverse engineering and forward simulation (REFS) | [ 124 ] |
82 obese women (40 non-diabetic and 42 diabetes) | Separability-correlation measure (SCM) and ANN | [ 57 ] |
13,309 Canadian patients | GBM and LR | [ 92 ] |
Kaggle diabetes dataset | RF | ... |
…”