Biocomputing 2020 2019
DOI: 10.1142/9789811215636_0061
|View full text |Cite
|
Sign up to set email alerts
|

Robust-ODAL: Learning from heterogeneous health systems without sharing patient-level data

Abstract: Electronic Health Records (EHR) contain extensive patient data on various health outcomes and risk predictors, providing an efficient and wide-reaching source for health research. Integrated EHR data can provide a larger sample size of the population to improve estimation and prediction accuracy. To overcome the obstacle of sharing patient-level data, distributed algorithms were developed to conduct statistical analyses across multiple clinical sites through sharing only aggregated information. However, the he… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
11
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
4
1

Relationship

4
5

Authors

Journals

citations
Cited by 15 publications
(11 citation statements)
references
References 23 publications
0
11
0
Order By: Relevance
“…Secondly, ODAC is based on the fact that we treat the pooled analysis as the gold standard method, which essentially requires data to be homogenous distributed across sites. However, the effect sizes as well as the baseline hazard function at each site can be different in practice 31 . To efficiently synthesize evidence from heterogeneous clinical sites in large CRN, we are planning to extend the current distributed algorithms by allowing site-specific hazard function and effect sizes in the future.…”
Section: Discussionmentioning
confidence: 99%
“…Secondly, ODAC is based on the fact that we treat the pooled analysis as the gold standard method, which essentially requires data to be homogenous distributed across sites. However, the effect sizes as well as the baseline hazard function at each site can be different in practice 31 . To efficiently synthesize evidence from heterogeneous clinical sites in large CRN, we are planning to extend the current distributed algorithms by allowing site-specific hazard function and effect sizes in the future.…”
Section: Discussionmentioning
confidence: 99%
“…In addition, EHR co-occurrence statistics are often specific to a local medical institution due to differences in local practices and the difficulties in patient data sharing. Co-occurrence statistics should be calculated across multiple institutions to learn more efficient and generalizable MCEs while minimizing bias of EHR from each institution [36][37][38] . The OMOP CDM can be leveraged to accomplish this since it provides a mechanism to convert local data models to its common data model.…”
Section: Discussionmentioning
confidence: 99%
“…Therefore, it requires that the data are homogeneously distributed across sites. However, when there exists heterogeneity in the distribution of data across sites, the pooled analysis may not be a gold standard and will require correcting the model to address the heterogeneity 34 . Taking this into consideration, we plan to extend our method to handle heterogeneity across clinical sites by allowing site-specific effects and covariates.…”
Section: Conclusion and Discussionmentioning
confidence: 99%