2020
DOI: 10.1109/tcbb.2018.2884701
|View full text |Cite
|
Sign up to set email alerts
|

Accelerating Sequence Alignments Based on FM-Index Using the Intel KNL Processor

Abstract: FM-index is a compact data structure suitable for fast matches of short reads to large reference genomes. The matching algorithm using this index exhibits irregular memory access patterns that cause frequent cache misses, resulting in a memory bound problem. This paper analyzes different FM-index versions presented in the literature, focusing on those computing aspects related to the data access. As a result of the analysis, we propose a new organization of FM-index that minimizes the demand for memory bandwid… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 10 publications
(7 citation statements)
references
References 36 publications
0
7
0
Order By: Relevance
“…However, KNL and KNM use out-of-order execution, high-bandwidth memory and interconnect. 10,53,73 Hence, they achieve high single core performance and overall throughput. Also, KNL/KNM support AVX-512 and all legacy ISAs such as x87/MMX/SSE/AVX/AVX2.…”
Section: Architectural Improvements Over Timementioning
confidence: 99%
See 2 more Smart Citations
“…However, KNL and KNM use out-of-order execution, high-bandwidth memory and interconnect. 10,53,73 Hence, they achieve high single core performance and overall throughput. Also, KNL/KNM support AVX-512 and all legacy ISAs such as x87/MMX/SSE/AVX/AVX2.…”
Section: Architectural Improvements Over Timementioning
confidence: 99%
“…On KNC, attaining high performance may require significant code-rewriting such as programming in intrinsics, 7,18,20,40,52,53,59,82,84,86,87 although the need of intrinsics is not as stringent on KNL due to the improvements in microarchitecture, compiler and ISA compared to KNC. 83 Use of intrinsics is error-prone and increases code-development time.…”
Section: Need Of Code-rewritingmentioning
confidence: 99%
See 1 more Smart Citation
“…Instead, simpler ARM based architectures along with high bandwidth memories can help in having an energy efficient architecture for NGS with better performance compared to existing solutions, leading to global energy savings for HPC data centers. NGS applications use full-text indexing strategies, such as the FM-index, based on Burrows-Wheeler transform (BWT) [18], [19] for fast sequence alignments. Additionally, Bowtie2 [20] is a new state-of-the-art NGS application based on FMindex, with efficient multi-threading capabilities.…”
Section: Introductionmentioning
confidence: 99%
“…Then, HISAT2 [22] is a graph based sequence alignment application and superior to both Bowtie2 and BWA-MEM in performance. HPC class compute resources like the Intel Xeon Phi KNL processors [23], Intel Skylake and Broadwell architectures, which support multiple threads, are used to run these NGS applications and maximize their performance [18], [24]. GPUs are also explored for genome sequencing as in [25].…”
Section: Introductionmentioning
confidence: 99%