2021
DOI: 10.1371/journal.pone.0250832
|View full text |Cite
|
Sign up to set email alerts
|

Nutritional markers of undiagnosed type 2 diabetes in adults: Findings of a machine learning analysis with external validation and benchmarking

Abstract: Objectives Using a nationally-representative, cross-sectional cohort, we examined nutritional markers of undiagnosed type 2 diabetes in adults via machine learning. Methods A total of 16429 men and non-pregnant women ≥ 20 years of age were analysed from five consecutive cycles of the National Health and Nutrition Examination Survey. Cohorts from years 2013–2016 (n = 6673) was used for external validation. Undiagnosed type 2 diabetes was determined by a negative response to the question “Have you ever been to… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0
2

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 9 publications
(8 citation statements)
references
References 61 publications
0
6
0
2
Order By: Relevance
“…But they cannot interpret the large amount of data collected during the formation of personal dietary recommendations, when analyzing the relation between intake of nutrients and the health of a consumer. Therefore it is necessary to supplement traditional statistical methods with machine-aided processing of data [44,45]. Machine-aided processing of datarefers to a computer system capable of describing the solution to a given problem and creating an algorithm based on this solution.…”
Section: Resultsmentioning
confidence: 99%
“…But they cannot interpret the large amount of data collected during the formation of personal dietary recommendations, when analyzing the relation between intake of nutrients and the health of a consumer. Therefore it is necessary to supplement traditional statistical methods with machine-aided processing of data [44,45]. Machine-aided processing of datarefers to a computer system capable of describing the solution to a given problem and creating an algorithm based on this solution.…”
Section: Resultsmentioning
confidence: 99%
“…The insufficiency of features might also be one of the reasons why our XGBoost model did not perform as well as in other research [ 28 ], whose model included 300 features. After all, in addition to demographic and lifestyle, nutrition intake has also been found to be an important predictor of incident diabetes [ 29 ]. However, high-dimensional features generally bring about information redundancy and overfitting problem.…”
Section: Discussionmentioning
confidence: 99%
“…To determine the undiagnosed T2DM in adults, the nutritional markers were found by ANN, LR, and RF models [ 55 ]. To overcome the impact of the class imbalance, resampling algorithms containing Random Oversampling Examples (ROSE), minority class over-sampling, and Synthetic Minority Oversampling Technique (SMOTE) were applied.…”
Section: The Application Of ML and Dl Models For The Management Predi...mentioning
confidence: 99%
“…The publicly available dataset are mentioned in bold Sample number ML models Refs. Early diagnosis and prediction of diabetes T2DM 15,005 subjects with age ≥ 3 XGBoost, DNN, and RF [ 63 ] 1512 subjects LR, RF, Naive Bayes (NB), SVM, XGBT, ANN, K-nearest neighbor (KNN), DT, XceptionResNet 50, DenseNet121, Vgg16, Vgg19, and InceptionV3, Stacking model of non-invasive variables and the Resnet50 model [ 53 ] 530 participants: 272 were diabetic patients and 258 were non-diabetic patients Deep autoencoder learning algorithm with CNN networks and deep radial basis function neural network (RBFNN) classifier [ 52 ] 217 participants with diabetes, prediabetes and normal conditions SVM, K-nearest neighbors, RF, XGBoost, hybrid feature selection-XGBoost [ 91 ] 2371 T1-weighted whole-body MRI data sets DenseNet architecture [ 54 ] 8454 subjects over five years of follow- up XGBoost, SVM, LR, RF, and ensemble algorithms [ 64 ] 16,429 men and non-pregnant women ≥ 20 years of age ANN, LR, and RF models [ 55 ] 453,487 T2DM patients Reverse engineering and forward simulation (REFS) [ 124 ] 82 obese women (40 non-diabetic and 42 diabetes) Separability-correlation measure (SCM) and ANN [ 57 ] 13,309 Canadian patients GBM and LR [ 92 ] Kaggle diabetes dataset RF ...…”
Section: Introductionmentioning
confidence: 99%