2024
DOI: 10.1158/0008-5472.can-23-2657
|View full text |Cite
|
Sign up to set email alerts
|

NCI Cancer Research Data Commons: Cloud-Based Analytic Resources

David Pot,
Zelia Worman,
Alexander Baumann
et al.

Abstract: The NCI’s Cloud Resources (CRs) are the analytical components of the Cancer Research Data Commons (CRDC) ecosystem. This review describes how the three CRs (Broad Institute FireCloud, Institute for Systems Biology Cancer Gateway in the Cloud, and Seven Bridges Cancer Genomics Cloud) provide access and availability to large, cloud-hosted, multi-modal cancer datasets, as well as offer tools and workspaces for performing data analysis where the data resides, without download or storage. In addition, users can upl… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 6 publications
(2 citation statements)
references
References 21 publications
0
2
0
Order By: Relevance
“…Instead of using the public cloud services directly, our approach used the infrastructure built on top of the generic capabilities provided by the cloud providers to support scientific analysis workflows. We implemented the processing pipeline described earlier as a portable workflow compatible with the CRDC Terra (available under FireCloud within CRDC; further referred to as Terra for the sake of brevity) and Seven Bridges Cancer Genomics Cloud (SB-CGC) cloud resources (CRDC CRs) 9 . CRDC CRs are analytical components of CRDC that aim to simplify access to cloud-ready tools and enable cloud-based analysis of the data available in CRDC.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Instead of using the public cloud services directly, our approach used the infrastructure built on top of the generic capabilities provided by the cloud providers to support scientific analysis workflows. We implemented the processing pipeline described earlier as a portable workflow compatible with the CRDC Terra (available under FireCloud within CRDC; further referred to as Terra for the sake of brevity) and Seven Bridges Cancer Genomics Cloud (SB-CGC) cloud resources (CRDC CRs) 9 . CRDC CRs are analytical components of CRDC that aim to simplify access to cloud-ready tools and enable cloud-based analysis of the data available in CRDC.…”
Section: Methodsmentioning
confidence: 99%
“…We present a detailed investigation of the development and optimization of a computational workflow to perform NLST segmentation efficiently - both in terms of processing time and costs - using the Google Cloud Platform and the components of the NCI Cancer Research Data Commons (CRDC) 9 - a cloud-based data science infrastructure that provides secure access to a large, comprehensive, and expanding collection of cancer research data.…”
Section: Introductionmentioning
confidence: 99%