2022
DOI: 10.48550/arxiv.2204.06604
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

EHRKit: A Python Natural Language Processing Toolkit for Electronic Health Record Texts

Abstract: The Electronic Health Record (EHR) is an essential part of the modern medical system and impacts healthcare delivery, operations, and research. Unstructured text is attracting much attention despite structured information in the EHRs and has become an exciting research field. The success of the recent neural Natural Language Processing (NLP) method has led to a new direction for processing unstructured clinical notes. In this work, we create a python library for clinical texts, EHRKit. This library contains tw… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 20 publications
(23 reference statements)
0
1
0
Order By: Relevance
“…Most similar to FrESCO is PyHealth (Zhao et al, 2021) though it is more broadly scoped, focusing on MIMIC (Medical Information Mart for Intensive Care), electronic intensive care unit (eICU), and observational medical outcomes partnership common data model (OMOP-CDM) databases. Biomedical libraries such as Med7 (Kormilitzin et al, 2021) and EHRkit (Li et al, 2022) focus on electronic health records in general and machine learning tasks such as named-entity recognition and document summarization. Our FrESCO library is singularly focused on cancer pathology reports and provides the model building workflow for auto-coding SEER pathology reports, which is a fundamental requirement in a clinical deployment environment (Harris et al, 2022).…”
Section: State Of the Fieldmentioning
confidence: 99%
“…Most similar to FrESCO is PyHealth (Zhao et al, 2021) though it is more broadly scoped, focusing on MIMIC (Medical Information Mart for Intensive Care), electronic intensive care unit (eICU), and observational medical outcomes partnership common data model (OMOP-CDM) databases. Biomedical libraries such as Med7 (Kormilitzin et al, 2021) and EHRkit (Li et al, 2022) focus on electronic health records in general and machine learning tasks such as named-entity recognition and document summarization. Our FrESCO library is singularly focused on cancer pathology reports and provides the model building workflow for auto-coding SEER pathology reports, which is a fundamental requirement in a clinical deployment environment (Harris et al, 2022).…”
Section: State Of the Fieldmentioning
confidence: 99%