Machine Learning Technique to Prognosis Diabetes Disease: Random Forest Classifier Approach

Palimkar, Prajyot; Shaw, Rabindra Nath; Ghosh, Ankush

doi:10.1007/978-981-16-2164-2_19

Cited by 126 publications

(39 citation statements)

References 14 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Random Forest (RF) [ 65 ] is a popular ML algorithm that belongs to the supervised learning technique. It is used in classification and regression problems.…”

Section: Materials and Methodsmentioning

confidence: 99%

Data-Driven Machine-Learning Methods for Diabetes Risk Prediction

Δρίτσας

Trigka

2022

Sensors

View full text Add to dashboard Cite

Diabetes mellitus is a chronic condition characterized by a disturbance in the metabolism of carbohydrates, fats and proteins. The most characteristic disorder in all forms of diabetes is hyperglycemia, i.e., elevated blood sugar levels. The modern way of life has significantly increased the incidence of diabetes. Therefore, early diagnosis of the disease is a necessity. Machine Learning (ML) has gained great popularity among healthcare providers and physicians due to its high potential in developing efficient tools for risk prediction, prognosis, treatment and the management of various conditions. In this study, a supervised learning methodology is described that aims to create risk prediction tools with high efficiency for type 2 diabetes occurrence. A features analysis is conducted to evaluate their importance and explore their association with diabetes. These features are the most common symptoms that often develop slowly with diabetes, and they are utilized to train and test several ML models. Various ML models are evaluated in terms of the Precision, Recall, F-Measure, Accuracy and AUC metrics and compared under 10-fold cross-validation and data splitting. Both validation methods highlighted Random Forest and K-NN as the best performing models in comparison to the other models.

show abstract

“…Random Forest (RF) [ 65 ] is a popular ML algorithm that belongs to the supervised learning technique. It is used in classification and regression problems.…”

Section: Materials and Methodsmentioning

confidence: 99%

Data-Driven Machine-Learning Methods for Diabetes Risk Prediction

Δρίτσας

Trigka

2022

Sensors

View full text Add to dashboard Cite

show abstract

“…The cumulative result of the trees provides a reasonable prediction. The model also identifies the most significant variables that explain the dependent variable, which frequently leads to improved performance (20,21). In this study, 100 trees were used in the RF model.…”

Section: Data Preprocessing and Development Of Predictive Modelsmentioning

confidence: 99%

Prediction of COVID-19 Based on Genomic Biomarkers of Metagenomic Next-Generation Sequencing Data Using Artificial Intelligence Technology

Akbulut¹

2022

Erciyes Med J

View full text Add to dashboard Cite

Objective:The primary aim of this study was to use metagenomic next-generation sequencing (mNGS) data to identify coronavirus 2019 (COVID-19)-related biomarker genes and to construct a machine learning model that could successfully differentiate patients with COVID-19 from healthy controls. Materials and Methods:The mNGS dataset used in the study demonstrated expression of 15,979 genes in the upper airway in 234 patients who were COVID-19 negative and COVID-19 positive. The Boruta method was used to select qualitative biomarker genes associated with COVID-19. Random forest (RF), gradient boosting tree (GBT), and multi-layer perceptron (MLP) models were used to predict COVID-19 based on the selected biomarker genes. Results:The MLP (0.936) model outperformed the GBT (0.851), and RF (0.809) models in predicting COVID-19. The three most important biomarker candidate genes associated with COVID-19 were IFI27, TPTI, and FAM83A. Conclusion:The proposed model (MLP) was able to predict COVID-19 successfully. The results showed that the generated model and selected biomarker candidate genes can be used as diagnostic models for clinical testing or potential therapeutic targets and vaccine design.

show abstract

“…In this process, we used the Random Forest algorithm to analyze this process and compare our results with the Decision Tree, Naive Bayes, Logistic Regression, and Support Vector Machine algorithms. The main reason for using Random Forest in this process is its good performance in terms of classification, as compared to the other algorithms [ 58 ]. Table 6 shows the details of each classifier’s performance for each fold.…”

Section: Experimental Results and Development Environmentmentioning

confidence: 99%

NLP-Based Digital Forensic Analysis for Online Social Network Based on System Security

Shahbazi

Byun

2022

IJERPH

View full text Add to dashboard Cite

Social media evidence is the new topic in digital forensics. If social media information is correctly explored, there will be significant support for investigating various offenses. Exploring social media information to give the government potential proof of a crime is not an easy task. Digital forensic investigation is based on natural language processing (NLP) techniques and the blockchain framework proposed in this process. The main reason for using NLP in this process is for data collection analysis, representations of every phase, vectorization phase, feature selection, and classifier evaluation. Applying a blockchain technique in this system secures the data information to avoid hacking and any network attack. The system’s potential is demonstrated by using a real-world dataset.

show abstract

Machine Learning Technique to Prognosis Diabetes Disease: Random Forest Classifier Approach

Cited by 126 publications

References 14 publications

Data-Driven Machine-Learning Methods for Diabetes Risk Prediction

Data-Driven Machine-Learning Methods for Diabetes Risk Prediction

Prediction of COVID-19 Based on Genomic Biomarkers of Metagenomic Next-Generation Sequencing Data Using Artificial Intelligence Technology

NLP-Based Digital Forensic Analysis for Online Social Network Based on System Security

Contact Info

Product

Resources

About