2020
DOI: 10.1186/s13040-020-00230-x
|View full text |Cite
|
Sign up to set email alerts
|

Comparative analysis, applications, and interpretation of electronic health record-based stroke phenotyping methods

Abstract: Background Accurate identification of acute ischemic stroke (AIS) patient cohorts is essential for a wide range of clinical investigations. Automated phenotyping methods that leverage electronic health records (EHRs) represent a fundamentally new approach cohort identification without current laborious and ungeneralizable generation of phenotyping algorithms. We systematically compared and evaluated the ability of machine learning algorithms and case-control combinations to phenotype acute ischemic stroke pati… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
11
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
3
3

Relationship

0
6

Authors

Journals

citations
Cited by 14 publications
(11 citation statements)
references
References 40 publications
0
11
0
Order By: Relevance
“…For example, 148 individuals with diabetes report HbA1c levels less than 35 mmol/mol, even though 48 mmol/mol is traditionally the diagnosis cut-off [59]. Several recent investigations have proven the practicality of using a wide range of diverse features to form accurate disease predictions that are robust to any single error in the electronic health records [28, 29]. However, these predictions are still far from perfect.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…For example, 148 individuals with diabetes report HbA1c levels less than 35 mmol/mol, even though 48 mmol/mol is traditionally the diagnosis cut-off [59]. Several recent investigations have proven the practicality of using a wide range of diverse features to form accurate disease predictions that are robust to any single error in the electronic health records [28, 29]. However, these predictions are still far from perfect.…”
Section: Discussionmentioning
confidence: 99%
“…If a prediction derived from the patient data confidently disagrees with the disease label derived from electronic health records, then the corresponding suspect individual could be removed from the analysis. Previous studies have demonstrated with evidence the accuracy of disease predictions derived from large datasets of patient data [28, 29], but to the best of our knowledge they have not assessed whether using disease predictions instead of direct disease labels effects the predictions of polygenic risk score models.…”
Section: Introductionmentioning
confidence: 99%
“…With the broad adoption of EHR systems and the collection of large amount of EHR data, EHR data mining has become a hot research topic. Typical tasks of EHR data mining include medical concept learning, 32 patient subtyping, 33 phenotyping, 34 disease progression analysis, 35 and outcome prediction. Outcome prediction, which aims to predict the future health state of a patient, is one of the most important tasks in EHR data mining.…”
Section: Related Workmentioning
confidence: 99%
“…[7] Due to the high-dimensional, messy, noisy data that constitute EHRs, many studies have developed ensemble or deep learning methods to train accurate models. [1017] Algorithms employed in these studies (e.g. random forests and neural networks) generally can perform well in classification but are often limited in their interpretability , a subjective concept defined as the extent to which a model can be understood and/or its behavior interpreted by a user.…”
Section: Introductionmentioning
confidence: 99%