Oswald

Rucci, Enzo; Garcı́a, Carlos; Botella, Guillermo; Giusti, Armando Eduardo De; Naiouf, Marcelo; Prieto-Matías, Manuel

doi:10.1177/1094342016654215

Cited by 31 publications

(13 citation statements)

References 23 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…2). It is important to mention that, although in the OSWALD implementation [10] Intel/Altera OpenCL channels are used to communicate these data, the use of this technique is not feasible in the context of DNA with millions of nucleotide bases involved, since its size would exceed by far the channel resources available. We should point out that although the use of these buffers could double memory consumption, it is by far compensated on speedup terms.…”

Section: Methodsmentioning

confidence: 99%

See 1 more Smart Citation

SWIFOLD: Smith-Waterman implementation on FPGA with OpenCL for long DNA sequences

et al. 2018

Self Cite

View full text Add to dashboard Cite

BackgroundThe Smith-Waterman (SW) algorithm is the best choice for searching similar regions between two DNA or protein sequences. However, it may become impracticable in some contexts due to its high computational demands. Consequently, the computer science community has focused on the use of modern parallel architectures such as Graphics Processing Units (GPUs), Xeon Phi accelerators and Field Programmable Gate Arrays (FGPAs) to speed up large-scale workloads.ResultsThis paper presents and evaluates SWIFOLD: a Smith-Waterman parallel Implementation on FPGA with OpenCL for Long DNA sequences. First, we evaluate its performance and resource usage for different kernel configurations. Next, we carry out a performance comparison between our tool and other state-of-the-art implementations considering three different datasets. SWIFOLD offers the best average performance for small and medium test sets, achieving a performance that is independent of input size and sequence similarity. In addition, SWIFOLD provides competitive performance rates in comparison with GPU-based implementations on the latest GPU generation for the large dataset.ConclusionsThe results suggest that SWIFOLD can be a serious contender for accelerating the SW alignment of DNA sequences of unrestricted size in an affordable way reaching on average 125 GCUPS and almost a peak of 270 GCUPS.

show abstract

Section: Methodsmentioning

confidence: 99%

“…Most of them correspond to protein alignment, and are parallelized on High-Performance Computing (HPC) architectures [7] and emerging architectures [8–10]. For very long sequences, such as with DNA, the number of works is significantly lower.…”

Section: Introductionmentioning

confidence: 99%

SWIFOLD: Smith-Waterman implementation on FPGA with OpenCL for long DNA sequences

et al. 2018

Self Cite

View full text Add to dashboard Cite

show abstract

“…The first three servers were used to evaluate SWIMM 2.0 and the other SIMD-based alternatives, while the rest were employed to perform a comparison with GPUs and FPGAs. The performance was evaluated by carrying out similar experiments to those in previous works [19,14,11,7]. We have evaluated SWIMM 2.0 by searching 20 query protein sequences against three well-known databases of different size:…”

Section: Experimental Designmentioning

confidence: 99%

“…This version computes using 32-bit integer data but allow us to include newer GPUs in the analysis. For FPGA accelerators, we have chosen the OS-WALD package [19] in its hybrid configuration because it offers a satisfactory performance-power tradeoff. Table 2 presents power efficiency ratios considering the GCUPS peak performance and the Thermal Design Power (TDP) of each platform.…”

Section: Performance and Power Efficiency Comparison With Gpus And Fpgasmentioning

confidence: 99%

“…Moreover, the recently released SWhybrid framework offers the possibility of combining CPUs, GPUs and Xeon Phi's [15]. Finally, there are also studies that use Field Programmable Gate Arrays (FPGAs) as accelerators, such as linear systolic array implementations for Xilinx Virtex FPGAs [16,17]; custom instructions [18] and the novel OpenCL paradigm on Intel-Altera's FPGAs [19].…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

SWIMM 2.0: Enhanced Smith–Waterman on Intel’s Multicore and Manycore Architectures Based on AVX-512 Vector Extensions

Rucci

Sánchez

Juan

et al. 2018

Int J Parallel Prog

Self Cite

View full text Add to dashboard Cite

The well-known Smith-Waterman (SW) algorithm is the most commonly used method for local sequence alignments, but its acceptance is limited by the computational requirements for large protein databases. Although the acceleration of SW has already been studied on many parallel platforms, there are hardly any studies which take advantage of the latest Intel architectures based on AVX-512 vector extensions. This SIMD set is currently supported by Intel's Knights Landing (KNL) accelerator and Intel's Skylake (SKL) general purpose processors. In this paper, we present an SW version that is optimized for both architectures: the renowned SWIMM 2.0. The novelty of this vector instruction set requires the revision of previous programming and optimization techniques. SWIMM 2.0 is based on a massive multi-threading and SIMD exploitation. It is competitive in terms of performance compared with other state-of-the-art implementations, reaching 511 GCUPS on a single KNL node and 734 GCUPS on a server equipped with a dual SKL processor. Moreover, these successful performance rates make SWIMM 2.0 the most efficient energy footprint implementation in this study achieving 2.94 GCUPS/Watts on the SKL processor.

show abstract

A Block-Based Systolic Array on an HBM2 FPGA for DNA Sequence Alignment

Abdelhamid

Yamaguchi

2020

Applied Reconfigurable Computing. Architectures, Tools, and Applications

View full text Add to dashboard Cite

Oswald

Cited by 31 publications

References 23 publications

SWIFOLD: Smith-Waterman implementation on FPGA with OpenCL for long DNA sequences

SWIFOLD: Smith-Waterman implementation on FPGA with OpenCL for long DNA sequences

SWIMM 2.0: Enhanced Smith–Waterman on Intel’s Multicore and Manycore Architectures Based on AVX-512 Vector Extensions

A Block-Based Systolic Array on an HBM2 FPGA for DNA Sequence Alignment

Contact Info

Product

Resources

About