2019
DOI: 10.1101/587436
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

High-throughput Multimodal Automated Phenotyping (MAP) with Application to PheWAS

Abstract: Objective: Electronic health records (EHR) linked with biorepositories are a powerful platform for translational studies. A major bottleneck exists in the ability to phenotype patients accurately and efficiently. The objective of this study was to develop an automated high-throughput phenotyping method integrating International Classification of Diseases (ICD) codes and narrative data extracted using natural language processing (NLP). Method:We developed a mapping method for automatically identifying relevant … Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
45
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
4
3
1

Relationship

3
5

Authors

Journals

citations
Cited by 23 publications
(45 citation statements)
references
References 35 publications
0
45
0
Order By: Relevance
“…Using the PRISM features, we PRISM features, we were able to train supervised self-learning (SSL) and transfer learning (STL) classifiers that resulted in AUC ROC of 0.97, which can be compared to the specialized computational phenotyping performances between 0.94 and 0.96 in the literature. 22 , 31 , 32 …”
Section: Resultsmentioning
confidence: 99%
“…Using the PRISM features, we PRISM features, we were able to train supervised self-learning (SSL) and transfer learning (STL) classifiers that resulted in AUC ROC of 0.97, which can be compared to the specialized computational phenotyping performances between 0.94 and 0.96 in the literature. 22 , 31 , 32 …”
Section: Resultsmentioning
confidence: 99%
“…For common conditions such as diabetes mellitus, including the primary International Classification of Diseases, Ninth Revision or Tenth Revision (ICD‐9/10) billing code for the condition (e.g., 250.00 for diabetes mellitus without mention of complications) and primary NLP concept alone (e.g., “diabetes”) in an algorithm can achieve relatively high PPVs (13). However, for episodic or uncommon conditions that may be discussed at only a handful of visits, such as pseudogout, additional features related to the condition may be useful.…”
Section: Methodsmentioning
confidence: 99%
“…To improve upon methods that only consider codes, machine learning tools, largely based upon NLPs, have been developed to collect more phenotypic data from data sources beyond standardized codes such as textual clinical notes, textual discharge summaries and radiology reports [1,[18][19][20][21]. Liao et al developed a multimodal automated phenotyping (MAP) algorithm to leverage both ICD codes and EMR textual narratives based on the Unified Medical Language System [18]. MAP is multimodal because it can extract entities such as ICDs, medical NLP concepts and healthcare utilization information related to a certain phenotype from both codes and free text.…”
Section: Emrs and Phenotype-genotype Association Researchmentioning
confidence: 99%