Somya D. Mohanty scite author profile

BackgroundDiabetes and cardiovascular disease are two of the main causes of death in the United States. Identifying and predicting these diseases in patients is the first step towards stopping their progression. We evaluate the capabilities of machine learning models in detecting at-risk patients using survey data (and laboratory results), and identify key variables within the data contributing to these diseases among the patients.MethodsOur research explores data-driven approaches which utilize supervised machine learning models to identify patients with such diseases. Using the National Health and Nutrition Examination Survey (NHANES) dataset, we conduct an exhaustive search of all available feature variables within the data to develop models for cardiovascular, prediabetes, and diabetes detection. Using different time-frames and feature sets for the data (based on laboratory data), multiple machine learning models (logistic regression, support vector machines, random forest, and gradient boosting) were evaluated on their classification performance. The models were then combined to develop a weighted ensemble model, capable of leveraging the performance of the disparate models to improve detection accuracy. Information gain of tree-based models was used to identify the key variables within the patient data that contributed to the detection of at-risk patients in each of the diseases classes by the data-learned models.ResultsThe developed ensemble model for cardiovascular disease (based on 131 variables) achieved an Area Under - Receiver Operating Characteristics (AU-ROC) score of 83.1% using no laboratory results, and 83.9% accuracy with laboratory results. In diabetes classification (based on 123 variables), eXtreme Gradient Boost (XGBoost) model achieved an AU-ROC score of 86.2% (without laboratory data) and 95.7% (with laboratory data). For pre-diabetic patients, the ensemble model had the top AU-ROC score of 73.7% (without laboratory data), and for laboratory based data XGBoost performed the best at 84.4%. Top five predictors in diabetes patients were 1) waist size, 2) age, 3) self-reported weight, 4) leg length, and 5) sodium intake. For cardiovascular diseases the models identified 1) age, 2) systolic blood pressure, 3) self-reported weight, 4) occurrence of chest pain, and 5) diastolic blood pressure as key contributors.ConclusionWe conclude machine learned models based on survey questionnaire can provide an automated identification mechanism for patients at risk of diabetes and cardiovascular diseases. We also identify key contributors to the prediction, which can be further explored for their implications on electronic health records.

show abstract

Understanding communication dynamics on Twitter during natural disasters: A case study of Hurricane Sandy

Pourebrahim

Sultana

Edwards

et al. 2019

International Journal of Disaster Risk Reduction

161

View full text Add to dashboard Cite

A sentiment analysis of U.S. local government tweets: The connection between tone and citizen involvement

Zavattaro

French

Mohanty

2015

Government Information Quarterly

144

View full text Add to dashboard Cite

Machine learning for predicting readmission risk among the frail: Explainable AI for healthcare

et al. 2022

View full text Add to dashboard Cite

show abstract

Comparison of a Frailty Risk Score and Comorbidity Indices for Hospital Readmission Using Electronic Health Record Data

Lekan¹,

McCoy²,

Jenkins³

et al. 2021

Research in Gerontological Nursing

View full text Add to dashboard Cite

The purpose of the current study was to investigate the predictive properties of five definitions of a frailty risk score (FRS) and three comorbidity indices using data from electronic health records (EHRs) of hospitalized adults aged ≥50 years for 3-day, 7-day, and 30-day readmission, and to identify an optimal model for a FRS and comorbidity combination. Retrospective analysis of the EHR dataset was performed, and multivariable logistic regression and area under the curve (AUC) were used to examine readmission for frailty and comorbidity. The sample ( N = 55,778) was mostly female (53%), non-Hispanic White (73%), married (53%), and on Medicare (55%). Mean FRSs ranged from 1.3 ( SD = 1.5) to 4.3 ( SD = 2.1). FRS and comorbidity were independently associated with readmission. Predictive accuracy for FRS and comorbidity combinations ranged from AUC of 0.75 to 0.77 (30-day readmission) to 0.84 to 0.85 (3-day readmission). FRS and comorbidity combinations performed similarly well, whereas comorbidity was always independently associated with readmission. FRS measures were more associated with 30-day readmission than 7-day and 3-day readmission. [ Research in Gerontological Nursing, 14 (2), 91–103.]

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Somya D. Mohanty

A data-driven approach to predicting diabetes and cardiovascular disease with machine learning

Understanding communication dynamics on Twitter during natural disasters: A case study of Hurricane Sandy

A sentiment analysis of U.S. local government tweets: The connection between tone and citizen involvement

Machine learning for predicting readmission risk among the frail: Explainable AI for healthcare

Comparison of a Frailty Risk Score and Comorbidity Indices for Hospital Readmission Using Electronic Health Record Data

Contact Info

Product

Resources

About