2021
DOI: 10.2196/22219
|View full text |Cite
|
Sign up to set email alerts
|

What Every Reader Should Know About Studies Using Electronic Health Record Data but May Be Afraid to Ask

Abstract: Coincident with the tsunami of COVID-19–related publications, there has been a surge of studies using real-world data, including those obtained from the electronic health record (EHR). Unfortunately, several of these high-profile publications were retracted because of concerns regarding the soundness and quality of the studies and the EHR data they purported to analyze. These retractions highlight that although a small community of EHR informatics experts can readily identify strengths and flaws in EHR-derived… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
85
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 90 publications
(85 citation statements)
references
References 26 publications
0
85
0
Order By: Relevance
“…25,38 Further, evaluating data provenance enables the reporting of data characteristics in EHR centric studies through presenting data completeness, data collection and handling, and the types of data. 36 The DQ failures identified in our data were of the typology well represented in the literature. [13][14][15][16][17][18][19][20] Missingness is a common challenge in the secondary use of EHR, and its causes include data being digitized after start of study, 14 and recording in free form clinical notes.…”
Section: Discussionmentioning
confidence: 55%
See 1 more Smart Citation
“…25,38 Further, evaluating data provenance enables the reporting of data characteristics in EHR centric studies through presenting data completeness, data collection and handling, and the types of data. 36 The DQ failures identified in our data were of the typology well represented in the literature. [13][14][15][16][17][18][19][20] Missingness is a common challenge in the secondary use of EHR, and its causes include data being digitized after start of study, 14 and recording in free form clinical notes.…”
Section: Discussionmentioning
confidence: 55%
“…The questions were guided by the why, how, and who of data recording, to interpret the data in the context of its generation, and subsequent lifecycle. Data provenance has been highlighted as an important component of EHR secondary use, [35][36][37] and other studies have documented the consideration of information flow as necessary for robust secondary use and avoiding biases. 25,38 Further, evaluating data provenance enables the reporting of data characteristics in EHR centric studies through presenting data completeness, data collection and handling, and the types of data.…”
Section: Discussionmentioning
confidence: 99%
“…To move from heterogenous and proprietary EHR data to OMOP that aligns to the FAIR principles [5], we developed a target oriented concept based on medical expertise and an EHR as well as an OMOP analysis (Figure 1). Working with EHR data in research requires a deep understanding of the original data (e.g data origin, data completeness, data correctness, data structure) and the given target environment for research [2]. Thus we built a multidisciplinary team of data and computer scientists as well as pharmacists.…”
Section: Methodsmentioning
confidence: 99%
“…The pandemic of coronavirus disease 2019 (COVID-19) has shown the need of standardized and reproducible research data, especially regarding drug administration, as observational studies are important to gain evidence, learn on real-word data and improve the COVID-19 patient treatment and their effects in the future [1]. However, those studies highly depend on the level of data quality, interoperability and reproducibility, even more if they are proceeded in a multi-centric environment [2].…”
Section: Introductionmentioning
confidence: 99%
“…High quality data sets are key assets for predicting prognosis and drug response from phenotypic, genotypic, and epigenetic data through innovative clinical trials and large-scale observational studies. However, in the rapidly growing field of real-world evidence generation, it is crucial to more critically evaluate EHR-driven studies the veracity of the data used to support the conclusions in order to prevent harm from misleading studies [15,16]. The accuracy of electronic health record data matters more than ever, especially due to the proliferation of clinical decision support, workflow systems and learning health systems.…”
Section: Ethical Legal Social Policy Issues and Solutions Stakeholder Participation And Research Networkmentioning
confidence: 99%