2022
DOI: 10.2196/39235
|View full text |Cite
|
Sign up to set email alerts
|

Issues With Variability in Electronic Health Record Data About Race and Ethnicity: Descriptive Analysis of the National COVID Cohort Collaborative Data Enclave

Abstract: Background The adverse impact of COVID-19 on marginalized and under-resourced communities of color has highlighted the need for accurate, comprehensive race and ethnicity data. However, a significant technical challenge related to integrating race and ethnicity data in large, consolidated databases is the lack of consistency in how data about race and ethnicity are collected and structured by health care organizations. Objective This study aims to evalu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
14
0

Year Published

2022
2022
2025
2025

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 23 publications
(14 citation statements)
references
References 20 publications
0
14
0
Order By: Relevance
“…Harmonization of these disparate data thus requires assumptions and inferences to be made that could incur systematic biases. Similarly, the ability to accurately determine race within N3C is diminished by variations in how race is reported in different healthcare systems 30 Nonetheless, we highlight the meticulous efforts from the N3C collaborative in evaluating and improving the quality of phenotypes generated within 31 N3C. Further, missingness is a known issue with N3C’s vaccination data, as patients may receive vaccine doses at pop-up clinics, drugstores, or at their place of employment, which may not end up recorded in the patient’s records.…”
Section: Discussionmentioning
confidence: 99%
“…Harmonization of these disparate data thus requires assumptions and inferences to be made that could incur systematic biases. Similarly, the ability to accurately determine race within N3C is diminished by variations in how race is reported in different healthcare systems 30 Nonetheless, we highlight the meticulous efforts from the N3C collaborative in evaluating and improving the quality of phenotypes generated within 31 N3C. Further, missingness is a known issue with N3C’s vaccination data, as patients may receive vaccine doses at pop-up clinics, drugstores, or at their place of employment, which may not end up recorded in the patient’s records.…”
Section: Discussionmentioning
confidence: 99%
“…Race and ethnicity data are collected from patients by healthcare workers, using these standard categories as a guide (Cruz and Smith, 2021). Once stored within the EHR, this data may be exchanged and aggregated with data from other health systems (Cook et al, 2022). When this data is extracted for analysis, missing data points may be imputed, new categories may be created, and others may be dropped entirely.…”
Section: Data Workmentioning
confidence: 99%
“…A recent analysis in N3C reported race and ethnicity biases and how these affect the quality of the harmonized data within N3C. 29 Notably, the analysis emphasizes the importance of making transparent the process of data harmonization to enable informed decisions on generating race and ethnicity phenotypes. We note that that this transparency is built into the design of N3C and is accessible to users of the system.…”
Section: Strategy 2: Leveraging Big Data and Informatics To Advance D...mentioning
confidence: 99%
“…The N3C platform enables critical insights into the relative burden of COVID-19 infections, hospitalizations, mortality by race/ethnicity and gender, and other social determinants of health factors. A recent analysis of N3C reported race and ethnicity biases and how these affect the quality of the harmonized data within N3C [29]. Notably, the analysis emphasizes the importance of making transparent the process of data harmonization to enable informed decisions on generating race and ethnicity phenotypes.…”
Section: Strategy 2: Leveraging Big Data and Informatics To Advance D...mentioning
confidence: 99%