2021
DOI: 10.1101/2021.05.26.21257872
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Cohort Design and Natural Language Processing to Reduce Bias in Electronic Health Records Research: The Community Care Cohort Project

Abstract: Background: Electronic health records (EHRs) promise to enable broad-ranging discovery with power exceeding that of conventional research cohort studies. However, research using EHR datasets may be subject to selection bias, which can be compounded by missing data, limiting the generalizability of derived insights. Methods: Mass General Brigham (MGB) is a large New England-based healthcare network comprising seven tertiary care and community hospitals with associated outpatient practices. Within an MGB-based … Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
9
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
3
3
1

Relationship

5
2

Authors

Journals

citations
Cited by 8 publications
(9 citation statements)
references
References 39 publications
0
9
0
Order By: Relevance
“…13,14,17 Baseline age, sex, race, height, weight, and blood pressure values were obtained from the EHR. 18 Antihypertensive use was determined using medication lists. 14 Tobacco use was categorized as present or absent.…”
Section: Clinical Factorsmentioning
confidence: 99%
“…13,14,17 Baseline age, sex, race, height, weight, and blood pressure values were obtained from the EHR. 18 Antihypertensive use was determined using medication lists. 14 Tobacco use was categorized as present or absent.…”
Section: Clinical Factorsmentioning
confidence: 99%
“…MGB is a large healthcare network serving the New England region of the US. We utilized the Community Care Cohort Project 24 , an EHR dataset comprising over 520,000 individuals who received longitudinal primary within the MGB system, which includes 7 academic and community hospitals with associated outpatient clinics.…”
Section: Methodsmentioning
confidence: 99%
“…7 Briefly, we trained a convolutional neural network (CNN) to predict 5-year risk of AF using 12-lead ECGs in a longitudinal patient cohort derived from the MGB network. 11 ECG-AI uses an encoding and loss function that takes into account both time to event (i.e., AF) and missingness introduced by censoring (death or loss of follow-up) to estimate a 5-year survival probability of AF. The predicted AF risk was then calculated as one minus the survival directly outputted from the ECG-AI model.…”
Section: Predicted Risk Of Afmentioning
confidence: 99%