2017
DOI: 10.1145/3039902.3039910
|View full text |Cite
|
Sign up to set email alerts
|

An Efficient GPUAccelerated Implementation of Genomic Short Read Mapping with BWAMEM

Abstract: Next Generation Sequencing techniques have resulted in an exponential growth in the generation of genetics data, the amount of which will soon rival, if not overtake, other Big Data fields, such as astronomy and streaming video services. To become useful, this data requires processing by a complex pipeline of algorithms, taking multiple days even on large clusters. The mapping stage of such genomics pipelines, which maps the short reads onto a reference genome, takes up a significant portion of execution time.… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 12 publications
(5 citation statements)
references
References 11 publications
0
5
0
Order By: Relevance
“…This is because ACC greatly reduces the computational bottleneck, which increases the relative effect of the storage subsystem on the end-to-end execution time. The ACC and Ideal-ISF+ACC results clearly show that data movement between the storage devices and the hardware accelerator, which has not been properly considered in prior read mapping accelerators [39,40,43,61,62,65,[70][71][72][73][74][75][76][77], can significantly bottleneck the potential benefits of the accelerator. Comparison to Other Near-Data Processing Systems.…”
Section: Results and Analysismentioning
confidence: 99%
“…This is because ACC greatly reduces the computational bottleneck, which increases the relative effect of the storage subsystem on the end-to-end execution time. The ACC and Ideal-ISF+ACC results clearly show that data movement between the storage devices and the hardware accelerator, which has not been properly considered in prior read mapping accelerators [39,40,43,61,62,65,[70][71][72][73][74][75][76][77], can significantly bottleneck the potential benefits of the accelerator. Comparison to Other Near-Data Processing Systems.…”
Section: Results and Analysismentioning
confidence: 99%
“…As high-quality algorithms such as BWA-MEM [37] became a de facto standard, the usability of the GPU-aware alignment softwares were limited. Some approaches [13], [29]- [31] tackle this problem and design a seed extension kernel general enough to be used for BWA-MEM using intra-query performance. However, later approaches based on inter-query parallelism outperformed these kernels, which is the strategy adopted by the current state-of-the-art methods such as NVBIO [3] or GASAL2 [9].…”
Section: B Gpu-accelerated Sequence Alignment Softwaresmentioning
confidence: 99%
“…To our knowledge the only application-level accelerated integrated implementations of BWA-MEM that exist are: an FPGA-accelerated implementation of the Seed Extension phase [15] achieving a 1.5x speedup, further improved in [16] for an overall 2.6x speedup; and a GPU implementation [9], further improved to achieve an up to 2x speedup [17]. The FPGA implementation used here builds on [15], and a comparison of the implementation here is made to the improved GPU implementation.…”
Section: Related Workmentioning
confidence: 99%
“…Then, the details of the FPGAaccelerated implementation on the Alpha Data card are given. Finally, details of the GPU implementation are briefly discussed (further details can be found in [17]). …”
Section: Architecture Design and Implementationmentioning
confidence: 99%
See 1 more Smart Citation