Biocomputing 2019 2018
DOI: 10.1142/9789813279827_0004
|View full text |Cite
|
Sign up to set email alerts
|

ODAL: A one-shot distributed algorithm to perform logistic regressions on electronic health records data from multiple clinical sites

Abstract: Electronic Health Records (EHR) contain extensive information on various health outcomes and risk factors, and therefore have been broadly used in healthcare research. Integrating EHR data from multiple clinical sites can accelerate knowledge discovery and risk prediction by providing a larger sample size in a more general population which potentially reduces clinical bias and improves estimation and prediction accuracy. To overcome the barrier of patient-level data sharing, distributed algorithms are develope… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
19
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
2
1

Relationship

2
6

Authors

Journals

citations
Cited by 26 publications
(19 citation statements)
references
References 25 publications
0
19
0
Order By: Relevance
“…However, a challenge for such data integration is that the identification of which sites belonging to the majority of sites is unknown. To handle this type of heterogeneity and keep the algorithm to be entirely data-driven, we propose the following distributed algorithm by strengthening the algorithm in Duan et al (2019) [20].…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…However, a challenge for such data integration is that the identification of which sites belonging to the majority of sites is unknown. To handle this type of heterogeneity and keep the algorithm to be entirely data-driven, we propose the following distributed algorithm by strengthening the algorithm in Duan et al (2019) [20].…”
Section: Methodsmentioning
confidence: 99%
“…To this end, in this paper, we attempt to develop a simple yet effective privacy-preserving distributed algorithm for fitting logistic regression within heterogeneous health systems without sharing patient-level data. The key idea is to modify the ODAL algorithm [20] by communicating robust summary statistics that are less sensitive to the existence of “outlying studies”. Through simulation studies and real data analysis using databases from the Janssen Research, we found that our new algorithm, which we refer to as the “robust-ODAL” method, is substantially more robust to the outlying studies and produces less biased estimates than the current ODAL method and traditional meta-analysis method.…”
Section: Introductionmentioning
confidence: 99%
“…The stateof-the-art method for multicenter study, without sharing patient-level information, is the meta-analysis, where each site fits separate analysis and all local estimates are synthesized using a weighted average 15 . In addition to metaanalysis, several distributed learning algorithms, in which only aggregate information is allowed to be shared across institutions, have been developed to overcome the privacy issue in a clinical research network [16][17][18][19] . Among the existing methods, Duan et al 20 .…”
Section: Introductionmentioning
confidence: 99%
“…In multicenter studies, sharing data is a major challenge due to privacy concerns 16 . To circumvent the issue of sharing individual patient-level data, many distributed algorithms have been developed to jointly study multiple datasets by communicating only summary-level statistics [17][18][19][20] . Among them, Duan et al [19][20] proposed a privacy-preserving Oneshot Distributed Algorithm for Logistic regression (ODAL), which can be used to identify risk factors of a binary healthcare outcome of interest.…”
Section: Introductionmentioning
confidence: 99%
“…To circumvent the issue of sharing individual patient-level data, many distributed algorithms have been developed to jointly study multiple datasets by communicating only summary-level statistics [17][18][19][20] . Among them, Duan et al [19][20] proposed a privacy-preserving Oneshot Distributed Algorithm for Logistic regression (ODAL), which can be used to identify risk factors of a binary healthcare outcome of interest. Compared to existing methods, ODAL requires only one round of communication across sites and can achieve high accuracy as a pooled analysis in which a logistic regression is fitted on the combined dataset 20 .…”
Section: Introductionmentioning
confidence: 99%