“…The maximally selected rank statistics RF outperformed the other random forest models. In case of feature selection, RF min depth filter produced most accurate models NA | 40 | Tan et al [ 55 ] | 2023 | To develop a reliable ML model using socio-demographics, vascular risk factors, and structural neuroimaging markers for early diagnosis of cognitive impairement in multi-ethnic Asian population | 911 participants from Epidemiology of Dementia in Singapore study | Neurodegenrative Disease | ML models- logistic regression, support vector machine, gradient boosting machine voting ensemble- SHAP | According to the voting ensemble, the important predictors of cognitive impairement are age, ethnicity, education attainment, and structural neuroimaging | LR (accuracy = 0.71, F1 = 0.78, AUC = 0.69, FPR = 0.38, sensitivity = 0.75, specificity = 0.62, PPV = 0.81, NPV = 0.54), SVM (accuracy = 0.74, F1 = 0.81, AUC = 0.71, FPR = 0.40, sensitivity = 0.81, specificity = 0.60, PPV = 0.81, NPV = 0.59), GBM (accuracy = 0.73, F1 = 0.79, AUC = 0.73, FPR = 0.29, sensitivity = 0.74, specificity = 0.71, PPV = 0.85, NPV = 0.56), Ensemble (accuracy = 0.83, F1 = 0.87, AUC = 0.80, FPR = 0.26, sensitivity = 0.86, specificity = 0.74, PPV = 0.88, NPV = 0.72) |
41 | Hu et al [ 56 ] | 2021 | To build a prediction model based on ML for cognitive impairement among Chinese community dwelling elderly people with normal cognition | 6718 individuals of age > 60, with MMSE score > = 18, not having any severe disease from the Chinese Longitudinal Health Longevity Survey (CLHLS) | Neurodegenrative Disease | To access 3-year risk of developing cognitive impairement, Ml models used- Random forest, XGBoost, Naïve Bayes, Logistic regression A nomogram was established to vividly present the prediction model | Features selected to develop model- age, instrumental activities of daily living, marital status, and baseline cognitive function Older people with nomogram score less than 170 are considered to have a low 3-year risk, and more than 173 are considered at higher risk | AUC, optimal cut off, sensitivity, specificity, accuracy, specificity/sensitivity values were reported for logisitc regression, random forest, naïve bayes, XG Boost both for validation dataset and test dataset |
42 | Fukunishi et al [ 57 ] | 2020 | To predict the risk of Alzheimer-type dementia for persons aged over 78 without receiving long-term care services using regularly collected claim data | 48,123 persons from claim data of health insurance and long-term care insurance in Japan | Ne... |
…”