2017
DOI: 10.18637/jss.v077.i13
|View full text |Cite
|
Sign up to set email alerts
|

Software for Distributed Computation on Medical Databases: A Demonstration Project

Abstract: Bringing together the information latent in distributed medical databases promises to personalize medical care by enabling reliable, stable modeling of outcomes with rich feature sets (including patient characteristics and treatments received). However, there are barriers to aggregation of medical data, due to lack of standardization of ontologies, privacy concerns, proprietary attitudes toward data, and a reluctance to give up control over end use. Aggregation of data is not always necessary for model fitting… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 12 publications
(9 citation statements)
references
References 16 publications
0
9
0
Order By: Relevance
“…For GWAS, our pipeline leverages an algorithm for performing PCA by communicating the LD-matrices (Algorithm 1), and subsequently a method for performing generalized regression (most commonly, linear and logistic regression) by iteratively solving a regularized regression at each silo. This general, iterative approach, known as Alternating Directions Method of Multipliers (11; 12), is guaranteed to converge (13), has suc-2 cessfully been applied to many problems (14; 15; 16), and for GWAS we show that moderately high accuracies can be achieved with a few iterations (also see (14)). We show that our pipeline is accurate, scalable, practical, and a significant improvement over the meta 1.0 approach.…”
Section: Introductionmentioning
confidence: 90%
“…For GWAS, our pipeline leverages an algorithm for performing PCA by communicating the LD-matrices (Algorithm 1), and subsequently a method for performing generalized regression (most commonly, linear and logistic regression) by iteratively solving a regularized regression at each silo. This general, iterative approach, known as Alternating Directions Method of Multipliers (11; 12), is guaranteed to converge (13), has suc-2 cessfully been applied to many problems (14; 15; 16), and for GWAS we show that moderately high accuracies can be achieved with a few iterations (also see (14)). We show that our pipeline is accurate, scalable, practical, and a significant improvement over the meta 1.0 approach.…”
Section: Introductionmentioning
confidence: 90%
“…Both obstacles can be overcome by turning to distributed computations, which consists in leaving the data on sites and distributing the calculations, so that hospitals only share some intermediate results instead of the raw data (Narasimhan et al, 2017). Among other methods, SVD, which only involves inner products and sums, can be very straightforwardly implemented in a distributed manner.…”
Section: Imputation Of Multilevel Mixed Datamentioning
confidence: 99%
“…Indeed, all the computations in Algorithm 4 can be done in parallel with a master-slave architecture (Narasimhan et al, 2017), where a central server collects summary statistics computed locally on sites, as illustrated Figure 5. Here, the local right singular vectors v j , j ∈ {1, .…”
Section: Distributed Rank-q Pcamentioning
confidence: 99%
“…Brown et al 2010a;Brown et al 2010b) and open source software (e.g. Carter et al 2016;Narasimhan et al 2017). The Canadian Network for Observational Drug Effect Studies (CNODES, Suissa et al 2012) and Mini-Sentinel (a safety surveillance system developed by the U.S. Food and Drugs Administration, Platt and Carnahan, 2012) are both platforms to facilitate the running of analysis requests from approved users locally, along with disclosure checks, prior to securely combining the results centrally as a meta-analysis.…”
Section: Alternative Approachesmentioning
confidence: 99%