2020
DOI: 10.1101/2020.03.23.003897
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Accelerating Maximal-Exact-Match Seeding with Enumerated Radix Trees

Abstract: Motivation: Read alignment is a time-consuming step in genome sequence analysis. In the read alignment software BWA-MEM and the recently published faster version BWA-MEM2, the seeding step is a major bottleneck, for instance, contributing 38% to the overall execution time in BWA-MEM2 when aligning single-end whole human genome reads from the Platinum Genomes dataset. This is because both BWA-MEM and BWA-MEM2 use a compressed index structure called the FMD-Index, which results in high memory bandwidth requireme… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 28 publications
0
2
0
Order By: Relevance
“…Sequence-to-Sequence Accelerators. Even though there are several hardware accelerators designed to alleviate bottlenecks in several steps of traditional sequence-to-sequence (S2S) mapping (e.g., pre-alignment filtering [72,73,75,76,94,[140][141][142][143][144][145][146][147][148], sequenceto-sequence alignment [68-70, 129-132, 149-151]), none of these designs can be directly employed for the sequence-to-graph (S2G) mapping problem. This is because S2S mapping is a special case of S2G mapping, where all nodes have only one edge (Figure 3a).…”
Section: Accelerating Sequence-to-graph Mappingmentioning
confidence: 99%
See 1 more Smart Citation
“…Sequence-to-Sequence Accelerators. Even though there are several hardware accelerators designed to alleviate bottlenecks in several steps of traditional sequence-to-sequence (S2S) mapping (e.g., pre-alignment filtering [72,73,75,76,94,[140][141][142][143][144][145][146][147][148], sequenceto-sequence alignment [68-70, 129-132, 149-151]), none of these designs can be directly employed for the sequence-to-graph (S2G) mapping problem. This is because S2S mapping is a special case of S2G mapping, where all nodes have only one edge (Figure 3a).…”
Section: Accelerating Sequence-to-graph Mappingmentioning
confidence: 99%
“…Existing hardware accelerators for genome sequence analysis focus on accelerating only the traditional sequence-to-sequence mapping pipeline, and cannot support genome graphs as their inputs. For example, GenStore [142], ERT [144], GenCache [143], NEST [145], MEDAL [146], SaVI [147], SMEM++ [148], Shifted Hamming Distance [94], GateKeeper [72], MAGNET [140], Shouji [141], and SneakySnake [73,76] accelerate the seeding and/or filtering steps of sequence-to-sequence mapping.…”
Section: Related Workmentioning
confidence: 99%