2019
DOI: 10.1093/jamia/ocz041
|View full text |Cite
|
Sign up to set email alerts
|

Integration of genetic and clinical information to improve imputation of data missing from electronic health records

Abstract: Objective Clinical data of patients’ measurements and treatment history stored in electronic health record (EHR) systems are starting to be mined for better treatment options and disease associations. A primary challenge associated with utilizing EHR data is the considerable amount of missing data. Failure to address this issue can introduce significant bias in EHR-based research. Currently, imputation methods rely on correlations among the structured phenotype variables in the EHR. However, … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
18
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
7
1
1

Relationship

1
8

Authors

Journals

citations
Cited by 19 publications
(18 citation statements)
references
References 36 publications
0
18
0
Order By: Relevance
“…≥n visits required to be eligible into cohort); however, this may exclude a substantial amount of patients with certain characteristics, incurring a selection bias or limiting the generalizability of study findings (see Challenge #1 ). Other strategies attempt to account for missing visit biases through longitudinal imputation approaches; for example, if a patient missed a visit, a disease activity score can be imputed for that point in time, given other data points [ 73 , 74 ]. Surrogate measures may also be used to infer patient outcomes, such as controlling for “informative” missingness as an indicator variable or using actual number of missed visits that were scheduled as a proxy for external circumstances influencing care [ 20 ].…”
Section: Challenge #4: Missing Visitsmentioning
confidence: 99%
“…≥n visits required to be eligible into cohort); however, this may exclude a substantial amount of patients with certain characteristics, incurring a selection bias or limiting the generalizability of study findings (see Challenge #1 ). Other strategies attempt to account for missing visit biases through longitudinal imputation approaches; for example, if a patient missed a visit, a disease activity score can be imputed for that point in time, given other data points [ 73 , 74 ]. Surrogate measures may also be used to infer patient outcomes, such as controlling for “informative” missingness as an indicator variable or using actual number of missed visits that were scheduled as a proxy for external circumstances influencing care [ 20 ].…”
Section: Challenge #4: Missing Visitsmentioning
confidence: 99%
“…In MOMDR, the Pareto cross-validation consistency (CVC) operation is used to determine the multiple solutions depending on the most instances of solutions in K Pareto sets. MOMDR process comprises these stages: (1) The dataset is divided into K subsets for the CV calculation and generate K Pareto sets. (2) All feasible m-locus combinations are generated, and data reduction technique is used to divide multilocus genotypes as high-and low-risk groups.…”
Section: B Momdrmentioning
confidence: 99%
“…Genome-wide association studies (GWASs) have demonstrated that multilocus single-nucleotide polymorphisms (SNPs) influence some diseases [1]- [5]. SNP-SNP interactions might be involved in some complex traits of these diseases [6]- [8], and the determination of SNP-SNP interactions could resolve missing heritability concerns [9].…”
Section: Introductionmentioning
confidence: 99%
“…For example, the missingness level for very important variables, such as hemoglobin A1C or HbA1c (LOINC ID: 17856-6) levels, a common biomarker for diabetes can easily reach 50% or more in many realistic large datasets. At last, in a more recent study, the integration of genetic and clinical information was shown to improve the imputation of data missing from the Electronic Health Records [ 13 ]; however, genetic data integrated with the EHR is still scarce.…”
Section: Introductionmentioning
confidence: 99%