2023
DOI: 10.1016/j.jbi.2022.104269
|View full text |Cite
|
Sign up to set email alerts
|

Mining for equitable health: Assessing the impact of missing data in electronic health records

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
28
0
2

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 43 publications
(30 citation statements)
references
References 20 publications
0
28
0
2
Order By: Relevance
“…This highlighted incomplete recording by clinical staff within the electronic patient record, indicating that other individuals meeting the study criteria may not have been identified. Other studies have found missing data in electronic patient records of individuals with intellectual disability, and missing data has been found to lead to poorer access to health care for those individuals (Getzen et al , 2023; Soper et al , 2022). A possible explanation for the lack of ICD 10 codes entered on the electronic patient record is that diagnostic codes are often recorded in psychiatry clinic letters, rather than officially uploaded to the electronic patient record as is the expected procedure.…”
Section: Discussionmentioning
confidence: 99%
“…This highlighted incomplete recording by clinical staff within the electronic patient record, indicating that other individuals meeting the study criteria may not have been identified. Other studies have found missing data in electronic patient records of individuals with intellectual disability, and missing data has been found to lead to poorer access to health care for those individuals (Getzen et al , 2023; Soper et al , 2022). A possible explanation for the lack of ICD 10 codes entered on the electronic patient record is that diagnostic codes are often recorded in psychiatry clinic letters, rather than officially uploaded to the electronic patient record as is the expected procedure.…”
Section: Discussionmentioning
confidence: 99%
“…A recent study using the Medical Information Mart for Intensive Care III (MIMIC-III) database demonstrates that the addition of missing data tends to more negatively impact the performance of disease prediction models for groups that have less access to healthcare [38]. Specifically, when stratified by insurance status, patients insured by Medicare/Medicaid experienced a sharper drop in model performance as the amount of missing data increased compared to those with private insurance [38].…”
Section: Phases Of Sample Selection and Evolutionmentioning
confidence: 99%
“…Accordingly, these patients' data may be spread across multiple health facilities, increasing the likelihood of missing observations in a single-center machine learning dataset. Individuals with low SES and racially minoritized individuals may also tend to use the healthcare system less often due to structural barriers resulting in less access, including receiving less testing, so studies examining an electronic health record would report more missing data for these patients [38].…”
Section: Input-level Exclusionmentioning
confidence: 99%
“…increased (2,3). In ICU cohort studies, missing data may introduce biased results as well as a loss of statistical power and precision (4).…”
mentioning
confidence: 99%
“…Using large cohorts in ICU research has become an increasingly common approach to answering clinically relevant questions (1). As the use of big data has increased, where data are often aggregated from multiple sources including electronic health records (EHRs), insurance claim databases, and government networks databases, concerns about data missingness have increased (2, 3). In ICU cohort studies, missing data may introduce biased results as well as a loss of statistical power and precision (4).…”
mentioning
confidence: 99%