2012
DOI: 10.1002/cpe.2927
|View full text |Cite
|
Sign up to set email alerts
|

Adapting bioinformatics applications for heterogeneous systems: a case study

Abstract: The advent of new sequencing technologies has generated extremely large amounts of information. To successfully apply bioinformatics tools to such large datasets, they need to exhibit scalability and ideally elasticity in diverse computing environments. We describe the application of Weaver to the PEMer structural variation detection workflow. Because the original workflow has an intractable sequential running time on large datasets, it also has a batch implementation designed for a shared file system. Using s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
4
0

Year Published

2012
2012
2017
2017

Publication Types

Select...
4
2

Relationship

3
3

Authors

Journals

citations
Cited by 7 publications
(4 citation statements)
references
References 17 publications
0
4
0
Order By: Relevance
“…Lanc et al . describes the adaptation and parallelization process of Paired‐End Mapper structural variation pipeline and the Burrows–Wheeler alignment tool for execution on clusters, grids, and clouds using the weaver/starch/makeflow workflow stack. Authors describe the application of previous obtained lessons to a new workflow with and without shared file storage to tract the intractable sequential running times of these applications on large datasets.…”
Section: Special Issue Papersmentioning
confidence: 99%
“…Lanc et al . describes the adaptation and parallelization process of Paired‐End Mapper structural variation pipeline and the Burrows–Wheeler alignment tool for execution on clusters, grids, and clouds using the weaver/starch/makeflow workflow stack. Authors describe the application of previous obtained lessons to a new workflow with and without shared file storage to tract the intractable sequential running times of these applications on large datasets.…”
Section: Special Issue Papersmentioning
confidence: 99%
“…Many of these annotation tasks have natural inherent parallelism. We previously demonstrated that a distributed computing framework can allow developers to leverage a variety of heterogeneous computational resources in a lightweight and relatively easy to program manner (Lanc et al, 2011;Moretti et al, 2009;Bui et al, 2011;Thrasher et al, 2010). There are a variety of similar existing frameworks.…”
Section: Introductionmentioning
confidence: 99%
“…All analysis is performed using our general and scalable framework that includes, among other tools, the discovery of SNPs [1], natural population haplotyping [2], and largescale annotation of diverse environmental samples [3,4]. We show runtime results of this framework using Amazon EC2, SGE, Condor and mixtures of the three.…”
mentioning
confidence: 99%