2020
DOI: 10.12720/jait.11.2.78-83
|View full text |Cite
|
Sign up to set email alerts
|

Comparison of Statistical Logistic Regression and RandomForest Machine Learning Techniques in Predicting Diabetes

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

1
13
0
1

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 53 publications
(15 citation statements)
references
References 17 publications
1
13
0
1
Order By: Relevance
“…CatBoost is commonly utilized in the fields of business (60), financial assessments (61), Medicare fraud detection (62), environmental science (63,64), and public science (36). According to our review of the literature, in the field of medicine, the random forest model has retained a competitive edge and is often superior in the prediction and classification of medical conditions compared with traditional logistic regression methods and machine learning methods such as neural networks, SVMs, and decision trees (65)(66)(67)(68). In our study, the CatBoost model outperformed the random forest model in classifying nonrecurrent and recurrent stroke.…”
Section: Discussionmentioning
confidence: 99%
“…CatBoost is commonly utilized in the fields of business (60), financial assessments (61), Medicare fraud detection (62), environmental science (63,64), and public science (36). According to our review of the literature, in the field of medicine, the random forest model has retained a competitive edge and is often superior in the prediction and classification of medical conditions compared with traditional logistic regression methods and machine learning methods such as neural networks, SVMs, and decision trees (65)(66)(67)(68). In our study, the CatBoost model outperformed the random forest model in classifying nonrecurrent and recurrent stroke.…”
Section: Discussionmentioning
confidence: 99%
“…We used the random forest model to make our diabetes risk assessment map, compared it with the assessment results of logistic regression, and noted that the assessment result was consistent with the actual prevalence. Thus, we conclude that the random forest model can achieve greater accuracy in assessing diabetes risk [ 41 ]. However, binary logistic regression analysis can intuitively explain diabetes risk factors, which is a disadvantage of the random forest model.…”
Section: Discussionmentioning
confidence: 99%
“…ROC: Receiver Operating Characteristic (ROC) Curve: It is a graphically way to display true positives versus false-positives across a series of cut-offs, and select the optimal cut off for clinical use. ( 13 )…”
Section: Methodsmentioning
confidence: 99%