2017
DOI: 10.12688/f1000research.10137.1
|View full text |Cite
|
Sign up to set email alerts
|

The Dockstore: enabling modular, community-focused sharing of Docker-based genomics tools and workflows

Abstract: As genomic datasets continue to grow, the feasibility of downloading data to a local organization and running analysis on a traditional compute environment is becoming increasingly problematic. Current large-scale projects, such as the ICGC PanCancer Analysis of Whole Genomes (PCAWG), the Data Platform for the U.S. Precision Medicine Initiative, and the NIH Big Data to Knowledge Center for Translational Genomics, are using cloud-based infrastructure to both host and perform analysis across large data sets. In … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
73
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 91 publications
(73 citation statements)
references
References 7 publications
0
73
0
Order By: Relevance
“…Incompatible versions of interacting tools would disrupt the workflow as a whole. This motivated Dockstore to shift from dynamic image creation with Ansible to ready-prepared Docker images [30]. The exact specification of versions from snapshot.debian.org, Bioconda, or unspecified versions of two tools together in the same release of a distribution is expected to overcome this difficulty.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Incompatible versions of interacting tools would disrupt the workflow as a whole. This motivated Dockstore to shift from dynamic image creation with Ansible to ready-prepared Docker images [30]. The exact specification of versions from snapshot.debian.org, Bioconda, or unspecified versions of two tools together in the same release of a distribution is expected to overcome this difficulty.…”
Section: Methodsmentioning
confidence: 99%
“…To the rescue come registries of software like Bio.Tools [17], and the resource identification initiative (https://scicrunch.org/resources) expands the same concept well beyond software. For sharing whole workflows, myExperiment (http://myexperiment.org/) is a well-established repository used by multiple workflow systems and research communities [10], complementing more specific workflow repositories like CWL Viewer [33] (https://view.commonwl.org/), Galaxy's ToolShed (https:// galaxyproject.org/toolshed/workflow-sharing/), and Dockstore [30].…”
Section: Shipping Confidence: a Workflow Perspectivementioning
confidence: 99%
“…As the usage of workflow management tools spreads, an increasing number of tertiary tools are tying into the ecosystem. The nf-core analysis pipelines are at the forefront of this, collaborating with initiatives such as bio.tools 28 , the GA4GH-compliant Dockstore 29 , and with plans to work together with the Biocontainers 18 project in the future to further simplify software packaging.…”
Section: Explicit Support For Running In Cloud Environmentsmentioning
confidence: 99%
“…Container registries provide an authoritative mapping from a container name (e.g., https://quay.io/repository/biocontainers/hisat2?tag=2.1.0-py27pl5.22.0_0&tab=tags) to a specific container image. We register containers with Quay (https://quay.io), and since we build standard open container initiative (OCI) containers (https://www.opencontainers.org), other registries (e.g., DockerHub, Cook, 2017; BioContainers, da Veiga Leprevost et al, 2017; or Dockstore, O’Connor et al, 2017) can also be used. Finally, in addition to creating containers for single Bioconda packages, it is possible to automatically create containers for combinations of packages.…”
Section: A Technology Stack For Reproducibilitymentioning
confidence: 99%