2015
DOI: 10.1093/jamia/ocv130
|View full text |Cite
|
Sign up to set email alerts
|

Combining billing codes, clinical notes, and medications from electronic health records provides superior phenotyping performance

Abstract: Multiple EHR components provide a more consistent and higher performance than a single one for the selected phenotypes. We suggest considering multiple EHR components for future phenotyping design in order to obtain an ideal result.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

4
169
2

Year Published

2016
2016
2024
2024

Publication Types

Select...
8
1

Relationship

1
8

Authors

Journals

citations
Cited by 169 publications
(175 citation statements)
references
References 42 publications
4
169
2
Order By: Relevance
“…Furthermore, structured data elements, such as International Classification of Diseases billing codes, can still be subject to high error rates and are often not sufficient for phenotyping activities. 22 …”
Section: Discussionmentioning
confidence: 99%
“…Furthermore, structured data elements, such as International Classification of Diseases billing codes, can still be subject to high error rates and are often not sufficient for phenotyping activities. 22 …”
Section: Discussionmentioning
confidence: 99%
“…19,20 Each data source poses unique challenges, and use of multiple data sources often improves performance. 21 Billing code-based phenotyping methods have variable performance with estimates for cardiovascular and stroke risk factors ranging from 0.55 to 0.95 positive predictive value (PPV). 22 Similarly, various phenotyping studies have used natural language processing (NLP)-extracted concepts alone, with sensitivities ranging from 72% to 99.6% and PPV between 63% and 100%.…”
Section: Background and Significancementioning
confidence: 99%
“…The time period is three months (90 days) before the occurrence of the target ADE event, i.e., up to but not including the time point when the target ADE has been assigned. 4 Basic descriptions of each dataset, including class label, the number of positive and negative examples, and the number of involved clinical measurements are presented in Table 1.…”
Section: Data Sourcementioning
confidence: 99%
“…To tackle the integration problem, studies have been conducted on the knowledge level and the data level, respectively. Some rely on domain knowledge to extract a joint patient cohort by dening criteria from dierent data types [4], while others explore the possibility of integrating heterogeneous EHR data prior to or post modeling [5,6,7,8]. The focus of this study is on the latter: analyzing complex longitudinal data.…”
Section: Introductionmentioning
confidence: 99%