2021
DOI: 10.48550/arxiv.2103.00562
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

CREATe: Clinical Report Extraction and Annotation Technology

Abstract: Clinical case reports are written descriptions of the unique aspects of a particular clinical case, playing an essential role in sharing clinical experiences about atypical disease phenotypes and new therapies. However, to our knowledge, there has been no attempt to develop an end-to-end system to annotate, index, or otherwise curate these reports. In this paper, we propose a novel computational resource platform, CREATe, for extracting, indexing, and querying the contents of clinical case reports. CREATe fost… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
2

Relationship

2
0

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 7 publications
0
2
0
Order By: Relevance
“…Domain-specific Pre-trained Language Model. To tackle domainspecific tasks, such as Clinical information extraction [82] and Bioinformatics knowledge acquisition [46], recent studies pre-train new language models with large-scale corpora collected from those domains [2,47] to learn customized token and sequence representations. Motivated by these approaches, we leverage all COVID-19 relevant text corpora together with the social media data to pretrain a CoronaBERT language model with 12 layers of Transformers and over 110 million parameters, in order to equip our models with powerful input embeddings.…”
Section: Constructing Dynamic Knowledge Graphs From Social Media Datamentioning
confidence: 99%
“…Domain-specific Pre-trained Language Model. To tackle domainspecific tasks, such as Clinical information extraction [82] and Bioinformatics knowledge acquisition [46], recent studies pre-train new language models with large-scale corpora collected from those domains [2,47] to learn customized token and sequence representations. Motivated by these approaches, we leverage all COVID-19 relevant text corpora together with the social media data to pretrain a CoronaBERT language model with 12 layers of Transformers and over 110 million parameters, in order to equip our models with powerful input embeddings.…”
Section: Constructing Dynamic Knowledge Graphs From Social Media Datamentioning
confidence: 99%
“…Clinical case reports (CCRs) are written descriptions of the unique aspects of a particular clinical case (Cabán-Martinez and García-Beltrán, 2012). They are intended to serve as educational aids to science and medicine, as they play an essential role in sharing clinical experiences about atypical disease phenotypes and new therapies (Caufield et al, 2018, Zhou et al, 2021a. Unlike other types of clinical documents (e.g., electronic medical records, or EMRs), CCRs generally describe single clinical narratives at a time: these are stories of diseases as they were observed and treated, written in language requiring domain familiarity but otherwise generally interpretable.…”
Section: Introductionmentioning
confidence: 99%