Zitao Shen scite author profile

Background Since no effective therapies exist for Alzheimer’s disease (AD), prevention has become more critical through lifestyle status changes and interventions. Analyzing electronic health records (EHRs) of patients with AD can help us better understand lifestyle’s effect on AD. However, lifestyle information is typically stored in clinical narratives. Thus, the objective of the study was to compare different natural language processing (NLP) models on classifying the lifestyle statuses (e.g., physical activity and excessive diet) from clinical texts in English. Methods Based on the collected concept unique identifiers (CUIs) associated with the lifestyle status, we extracted all related EHRs for patients with AD from the Clinical Data Repository (CDR) of the University of Minnesota (UMN). We automatically generated labels for the training data by using a rule-based NLP algorithm. We conducted weak supervision for pre-trained Bidirectional Encoder Representations from Transformers (BERT) models and three traditional machine learning models as baseline models on the weakly labeled training corpus. These models include the BERT base model, PubMedBERT (abstracts + full text), PubMedBERT (only abstracts), Unified Medical Language System (UMLS) BERT, Bio BERT, Bio-clinical BERT, logistic regression, support vector machine, and random forest. The rule-based model used for weak supervision was tested on the GSC for comparison. We performed two case studies: physical activity and excessive diet, in order to validate the effectiveness of BERT models in classifying lifestyle status for all models were evaluated and compared on the developed Gold Standard Corpus (GSC) on the two case studies. Results The UMLS BERT model achieved the best performance for classifying status of physical activity, with its precision, recall, and F-1 scores of 0.93, 0.93, and 0.92, respectively. Regarding classifying excessive diet, the Bio-clinical BERT model showed the best performance with precision, recall, and F-1 scores of 0.93, 0.93, and 0.93, respectively. Conclusion The proposed approach leveraging weak supervision could significantly increase the sample size, which is required for training the deep learning models. By comparing with the traditional machine learning models, the study also demonstrates the high performance of BERT models for classifying lifestyle status for Alzheimer’s disease in clinical notes.

show abstract

Virtual Reality System for Invasive Therapy

Kong

Wang

Shen

2021

View full text Add to dashboard Cite

The moderating effect of employment burnout between involution behavior and employment preparation behavior of Chinese university students

Shen¹,

SUN²

2023

Korean Assoc Learner-Centered Curric Instr

View full text Add to dashboard Cite

Objectives In this study, we tried to find out the moderating effect of employment burnout between involution behavior and employment preparation behavior. Methods To this end, 282 Chinese university students were surveyed for involution behavior, employment preparation behavior, and employment burnout. Based on the collected data, the data of 271 people were analyzed using the SPSS Statistics 26.0 program. Correlation analysis and hierarchical regression analysis were conducted to examine the moderating effect of employment burnout in the relationship between involution behavior and employment preparation behavior. Results As a result of correlation analysis, it was found that involution behavior had a significant positive correlation with both employment burnout and employment preparation behavior. As a result of hierarchical regression analysis, it was found that employment burnout had a moderating effect in the process of involution behavior affecting employment preparation behavior. Among them, the moderating effects of exhaustion and dehumanization, which are sub-factors of employment burnout, were not significant, and the moderating effects of antipathy, inability, and negative beliefs can be confirmed. Conclusions The results of this study will be used as basic data to understand not only the relationship between the three variables, but also the overall employment burnout and the moderating effects of each sub-factor.

show abstract

Natural Language Processing Methods to Extract Lifestyle Exposures for Alzheimer’s Disease from Clinical Notes

Yi¹,

Shen²,

Bompelli³

et al. 2020

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Zitao Shen

Extracting Lifestyle Factors for Alzheimer's Disease from Clinical Notes Using Deep Learning with Weak Supervision

Classifying the lifestyle status for Alzheimer’s disease from clinical notes using deep learning with weak supervision

Virtual Reality System for Invasive Therapy

The moderating effect of employment burnout between involution behavior and employment preparation behavior of Chinese university students

Natural Language Processing Methods to Extract Lifestyle Exposures for Alzheimer’s Disease from Clinical Notes

Contact Info

Product

Resources

About