2022
DOI: 10.1101/2022.04.23.22274218
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Machine Learning Approaches for Electronic Health Records Phenotyping: A Methodical Review

Abstract: ObjectiveAccurate and rapid methods for phenotyping are a prerequisite to realizing the potential of electronic health records (EHRs) data for clinical and translational research. This study reviews the literature on machine learning (ML) approaches for phenotyping with respect to the phenotypes considered, the data sources and methods used, and the contributions within the wider context of EHR-based research.Materials and MethodsWe searched for relevant articles in PubMed and Web of Science published between … Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
2
1
1

Relationship

1
6

Authors

Journals

citations
Cited by 8 publications
(4 citation statements)
references
References 179 publications
(589 reference statements)
0
4
0
Order By: Relevance
“…This can be especially useful in urosepsis, where imaging can be used to identify the source of infection, such as a urinary tract obstruction or a kidney abscess. 4,[11][12][13] .…”
Section: Introductionmentioning
confidence: 99%
“…This can be especially useful in urosepsis, where imaging can be used to identify the source of infection, such as a urinary tract obstruction or a kidney abscess. 4,[11][12][13] .…”
Section: Introductionmentioning
confidence: 99%
“…To address this gap, researchers have developed machine learning algorithms intended to classify adult patients with PASC in large clinical databases [ 5 ]. In contrast with explicit rule-based definitions, machine learning algorithms have the advantage of being able to detect complex patterns involving thousands of covariates [ 6 ].…”
Section: Introductionmentioning
confidence: 99%
“…Unstructured variables in EHRs are usually dealt with using manual abstraction and proxy variables available as structured data 5 . Manual abstraction is time-consuming and not scalable, especially with the increase in EHR data 6 .…”
Section: Introductionmentioning
confidence: 99%
“…false positive)? * 13 5. 10 20How many patient charts would you be willing to review manually to prevent one patient who has GBM from being left out of the dataset (i.e.…”
mentioning
confidence: 99%