2018
DOI: 10.1371/journal.pone.0192586
|View full text |Cite
|
Sign up to set email alerts
|

Towards phenotyping stroke: Leveraging data from a large-scale epidemiological study to detect stroke diagnosis

Abstract: Objective1) To develop a machine learning approach for detecting stroke cases and subtypes from hospitalization data, 2) to assess algorithm performance and predictors on real-world data collected by a large-scale epidemiology study in the US; and 3) to identify directions for future development of high-precision stroke phenotypic signatures.Materials and methodsWe utilized 8,131 hospitalization events (ICD-9 codes 430–438) collected from the Greater Cincinnati/Northern Kentucky Stroke Study in 2005 and 2010. … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

3
37
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 27 publications
(40 citation statements)
references
References 64 publications
3
37
0
Order By: Relevance
“…In demonstrating that AIS patients can be recovered from other EHR-available structured clinical features without AIS codes, this approach is in contrast to previous machine learning phenotyping algorithms, which have relied on manually curated features or use AIS-related diagnosis codes as the sole nonzero features in their models. 23,24,3 Cases and controls for training of phenotyping algorithms can be challenging to identify and de ne given the richness of available EHR data. From the sparsity of diagnosis codes in the EHR, it follows that patients lacking an AIS-related diagnosis code may not always be considered as a control in stroke cohorts.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…In demonstrating that AIS patients can be recovered from other EHR-available structured clinical features without AIS codes, this approach is in contrast to previous machine learning phenotyping algorithms, which have relied on manually curated features or use AIS-related diagnosis codes as the sole nonzero features in their models. 23,24,3 Cases and controls for training of phenotyping algorithms can be challenging to identify and de ne given the richness of available EHR data. From the sparsity of diagnosis codes in the EHR, it follows that patients lacking an AIS-related diagnosis code may not always be considered as a control in stroke cohorts.…”
Section: Discussionmentioning
confidence: 99%
“…[19][20][21][22] Stroke phenotyping algorithms have also used machine learning to enhance the classi cation performance of a diagnosis-code based AIS phenotyping algorithm. [23][24][25][26] However, while ML models present an opportunity to automate identi cation of AIS patients (i.e. phenotyping) with commonly accessible EHR data and develop new approaches to etiologic identi cation and subtyping, the optimal combination of cases and controls to train such models remains unclear.…”
Section: Introductionmentioning
confidence: 99%
“…In demonstrating that AIS patients can be recovered from other EHR-available structured clinical features without AIS codes, this approach is in contrast to previous machine learning phenotyping algorithms, which have relied on manually curated features or use AIS-related diagnosis codes as the sole nonzero features in their models. 15,16,3 Cases and controls for training of phenotyping algorithms can be challenging to identify and define given the richness of available EHR data. From the sparsity of diagnosis codes in the EHR, it follows that patients lacking an AIS-related diagnosis code may not always be considered as a control in stroke cohorts.…”
Section: Discussionmentioning
confidence: 99%
“…Third, because our model incorporated structured data from standard terminologies, they therefore may be generalizable to other health systems outside CUIMC, whereas recent studies have relied on manually curated feature sets. 15 Fourth, we examined several different combinations of cases, controls and classifiers for the purposes of training phenotyping models.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation