2017
DOI: 10.1038/nbt.3772
|View full text |Cite
|
Sign up to set email alerts
|

Toil enables reproducible, open source, big biomedical data analyses

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
811
0
1

Year Published

2017
2017
2024
2024

Publication Types

Select...
7
1

Relationship

2
6

Authors

Journals

citations
Cited by 1,007 publications
(877 citation statements)
references
References 14 publications
1
811
0
1
Order By: Relevance
“…It is essential that experiments are analyzed in a reproducible manner. Computational workflow systems allow for automation of analysis pipelines, that scale from personal computers to HPC and cloud environments (Koster and Rahmann 2012;Di Tommaso et al 2017;Vivian et al 2017). While standardized solutions exist for installation of software and analysis tools ("Pip," n.d.; "Bioconda," n.d.), data download often has to either be performed manually or be scripted on a caseby-case basis.…”
Section: Discussionmentioning
confidence: 99%
“…It is essential that experiments are analyzed in a reproducible manner. Computational workflow systems allow for automation of analysis pipelines, that scale from personal computers to HPC and cloud environments (Koster and Rahmann 2012;Di Tommaso et al 2017;Vivian et al 2017). While standardized solutions exist for installation of software and analysis tools ("Pip," n.d.; "Bioconda," n.d.), data download often has to either be performed manually or be scripted on a caseby-case basis.…”
Section: Discussionmentioning
confidence: 99%
“…Both the pipeline and associated documentation can be found at https://github.com/ComparativeGenomicsToolkit/ Comparative-Annotation-Toolkit. CAT is constructed using the Luigi (https:// github.com/spotify/luigi) workflow manager, with Toil [154] used for computationally intensive steps that work best when submitted to a compute cluster.…”
Section: Methodsmentioning
confidence: 99%
“…Both the Dockerfile and the resulting container can be moved between machines without installing additional software, and the container can be rerun later with the exact same libraries. Containers can be orchestrated for parallel execution using, for example, Kubernetes (https://kubernetes.io/) or Docker Swarm (https://github.com/docker/swarm), and there are now multiple pipelining tools that use Docker or provide Docker container support including Nextflow [26], Toil [22], Pachyderm (http://www.pachyderm.io/), Luigi (https:// github.com/spotify/luigi) [27], Rabix/bunny [28], and our own walrus system (http://github.com/fjukstad/walrus).…”
Section: Current Trendsmentioning
confidence: 99%
“…Toil is a workflow software to run scientific workflows on a large scale in cloud or high-performance computing (HPC) environments [22]. It is designed for large-scale analysis pipelines such as The Cancer Genome Atlas (TCGA) [23] best practices pipeline for calculating gene-and isoformlevel expression values from RNA-seq data.…”
Section: Toil: Tcga Rna-seq Reference Pipelinementioning
confidence: 99%
See 1 more Smart Citation