Gloria Lipori scite author profile

There is an increasing interest in developing artificial intelligence (AI) systems to process and interpret electronic health records (EHRs). Natural language processing (NLP) powered by pretrained language models is the key technology for medical AI systems utilizing clinical narratives. However, there are few clinical language models, the largest of which trained in the clinical domain is comparatively small at 110 million parameters (compared with billions of parameters in the general domain). It is not clear how large clinical language models with billions of parameters can help medical AI systems utilize unstructured EHRs. In this study, we develop from scratch a large clinical language model—GatorTron—using >90 billion words of text (including >82 billion words of de-identified clinical text) and systematically evaluate it on five clinical NLP tasks including clinical concept extraction, medical relation extraction, semantic textual similarity, natural language inference (NLI), and medical question answering (MQA). We examine how (1) scaling up the number of parameters and (2) scaling up the size of the training data could benefit these NLP tasks. GatorTron models scale up the clinical language model from 110 million to 8.9 billion parameters and improve five clinical NLP tasks (e.g., 9.6% and 9.5% improvement in accuracy for NLI and MQA), which can be applied to medical AI systems to improve healthcare delivery. The GatorTron models are publicly available at: https://catalog.ngc.nvidia.com/orgs/nvidia/teams/clara/models/gatortron_og.

show abstract

Assessing the practice of data quality evaluation in a national clinical data research network through a systematic scoping review in the era of real-world data

Bian

Lyu

Loiacono

et al. 2020

View full text Add to dashboard Cite

Objective To synthesize data quality (DQ) dimensions and assessment methods of real-world data, especially electronic health records, through a systematic scoping review and to assess the practice of DQ assessment in the national Patient-centered Clinical Research Network (PCORnet). Materials and Methods We started with 3 widely cited DQ literature—2 reviews from Chan et al (2010) and Weiskopf et al (2013a) and 1 DQ framework from Kahn et al (2016)—and expanded our review systematically to cover relevant articles published up to February 2020. We extracted DQ dimensions and assessment methods from these studies, mapped their relationships, and organized a synthesized summarization of existing DQ dimensions and assessment methods. We reviewed the data checks employed by the PCORnet and mapped them to the synthesized DQ dimensions and methods. Results We analyzed a total of 3 reviews, 20 DQ frameworks, and 226 DQ studies and extracted 14 DQ dimensions and 10 assessment methods. We found that completeness, concordance, and correctness/accuracy were commonly assessed. Element presence, validity check, and conformance were commonly used DQ assessment methods and were the main focuses of the PCORnet data checks. Discussion Definitions of DQ dimensions and methods were not consistent in the literature, and the DQ assessment practice was not evenly distributed (eg, usability and ease-of-use were rarely discussed). Challenges in DQ assessments, given the complex and heterogeneous nature of real-world data, exist. Conclusion The practice of DQ assessment is still limited in scope. Future work is warranted to generate understandable, executable, and reusable DQ measures.

show abstract

School-Located Influenza Vaccination Reduces Community Risk for Influenza and Influenza-Like Illness Emergency Care Visits

et al. 2014

View full text Add to dashboard Cite

BackgroundSchool-located influenza vaccination (SLIV) programs can substantially enhance the sub-optimal coverage achieved under existing delivery strategies. Randomized SLIV trials have shown these programs reduce laboratory-confirmed influenza among both vaccinated and unvaccinated children. This work explores the effectiveness of a SLIV program in reducing the community risk of influenza and influenza-like illness (ILI) associated emergency care visits.MethodsFor the 2011/12 and 2012/13 influenza seasons, we estimated age-group specific attack rates (AR) for ILI from routine surveillance and census data. Age-group specific SLIV program effectiveness was estimated as one minus the AR ratio for Alachua County versus two comparison regions: the 12 county region surrounding Alachua County, and all non-Alachua counties in Florida.ResultsVaccination of ∼50% of 5–17 year-olds in Alachua reduced their risk of ILI-associated visits, compared to the rest of Florida, by 79% (95% confidence interval: 70, 85) in 2011/12 and 71% (63, 77) in 2012/13. The greatest indirect effectiveness was observed among 0–4 year-olds, reducing AR by 89% (84, 93) in 2011/12 and 84% (79, 88) in 2012/13. Among all non-school age residents, the estimated indirect effectiveness was 60% (54, 65) and 36% (31, 41) for 2011/12 and 2012/13. The overall effectiveness among all age-groups was 65% (61, 70) and 46% (42, 50) for 2011/12 and 2012/13.ConclusionWider implementation of SLIV programs can significantly reduce the influenza-associated public health burden in communities.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Gloria Lipori

MySurgeryRisk: Development and Validation of a Machine-learning Risk Algorithm for Major Complications and Death After Surgery

A large language model for electronic health records

Assessing the practice of data quality evaluation in a national clinical data research network through a systematic scoping review in the era of real-world data

School-Located Influenza Vaccination Reduces Community Risk for Influenza and Influenza-Like Illness Emergency Care Visits

Contact Info

Product

Resources

About