2015
DOI: 10.1016/j.jbi.2015.09.018
|View full text |Cite
|
Sign up to set email alerts
|

Creation of a new longitudinal corpus of clinical narratives

Abstract: The 2014 i2b2/UTHealth Natural Language Processing (NLP) shared task featured a new longitudinal corpus of 1,304 records representing 296 diabetic patients. The corpus contains three cohorts: patients who have a diagnosis of coronary artery disease (CAD) in their first record, and continue to have it in subsequent records; patients who do not have a diagnosis of CAD in the first record, but develop it by the last record; patients who do not have a diagnosis of CAD in any record. This paper details the process … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
23
0

Year Published

2015
2015
2024
2024

Publication Types

Select...
9

Relationship

4
5

Authors

Journals

citations
Cited by 29 publications
(23 citation statements)
references
References 30 publications
0
23
0
Order By: Relevance
“…The 2014 corpus consists of “a mixture of discharge summaries and correspondences between medical professionals” (Kumar et al 2015) and included an expanded list of PHI (see Section 4.1 for the full list). Ten teams submitted 22 system runs to this shared task.…”
Section: Related Workmentioning
confidence: 99%
“…The 2014 corpus consists of “a mixture of discharge summaries and correspondences between medical professionals” (Kumar et al 2015) and included an expanded list of PHI (see Section 4.1 for the full list). Ten teams submitted 22 system runs to this shared task.…”
Section: Related Workmentioning
confidence: 99%
“…All the records for a single patient are either in the training or the testing set, and each of the three cohorts are represented equally in the training and testing data. A full description of the data and the corpus selection process can be found in Kumar et al (this issue).…”
Section: Datamentioning
confidence: 99%
“…The criteria for classifying CAD and no-CAD patients in our study has been defined and validated in two earlier studies [31],[3] . To create the corpus for the 2014 i2b2 Heart Disease Risk Factors Challenge, an expert cardiologist developed the definition for CAD.…”
Section: Datamentioning
confidence: 99%