2021
DOI: 10.1371/journal.pcbi.1008880
|View full text |Cite
|
Sign up to set email alerts
|

Orchestrating privacy-protected big data analyses of data from different resources with R and DataSHIELD

Abstract: Combined analysis of multiple, large datasets is a common objective in the health- and biosciences. Existing methods tend to require researchers to physically bring data together in one place or follow an analysis plan and share results. Developed over the last 10 years, the DataSHIELD platform is a collection of R packages that reduce the challenges of these methods. These include ethico-legal constraints which limit researchers’ ability to physically bring data together and the analytical inflexibility assoc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
34
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
2
1
1

Relationship

2
6

Authors

Journals

citations
Cited by 33 publications
(34 citation statements)
references
References 23 publications
0
34
0
Order By: Relevance
“…In addition to dsMTL, other packages in the DataSHIELD ecosystem exist for e.g. "big data" storage and management 19 , various statistical tests 7,19 and deep learning 19,20 .…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…In addition to dsMTL, other packages in the DataSHIELD ecosystem exist for e.g. "big data" storage and management 19 , various statistical tests 7,19 and deep learning 19,20 .…”
Section: Discussionmentioning
confidence: 99%
“…dsMTL was designed to support a wide variety of data types. For this, an architecture package resourcer 19 developed by the DataSHIELD community was incorporated to facilitate the efficient import and export of large-scale datasets in compressed formats.…”
Section: Efficiencymentioning
confidence: 99%
See 1 more Smart Citation
“…KP-INTIMIC is a knowledge hub comprising 55 partners from 9 countries with the aim of fostering studies on the microbiome, nutrition, and health, making them findable, accessible, interoperable, and reusable (FAIR) to the scientific community to reduce fragmentation. The consortium also aims to (1) standardize and harmonize data for comparability, (2) move from association to causality, and (3) facilitate data sharing [ 25 , 26 ]. The consortium was assembled in response to a call by the Joint Programming Initiative “A Healthy Diet for a Healthy Life” (JPI-HDHL) ERA-Net Cofund “Interrelation of the INtesTInal MICrobiome, Diet and Health” (HDHL-INTIMIC).…”
Section: Methodsmentioning
confidence: 99%
“…To this end, our recent development of the “resources” architecture in DataSHIELD will facilitate handling complex big data, including omics, within DataSHIELD through the Opal data warehouse. 85 …”
Section: Project Descriptionmentioning
confidence: 99%