2021
DOI: 10.1016/j.cpc.2020.107513
|View full text |Cite
|
Sign up to set email alerts
|

A hybrid MPI-CUDA approach for nonequispaced discrete Fourier transformation

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
1

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 41 publications
0
4
0
Order By: Relevance
“…Therefore, we proposed a hybrid parallel scheme combining multiple CPU and GPU devices to upgrade the CU-ENUF method, which is described as HP-ENUF method. [44,45] Similar to the CU-ENUF method, [43]…”
Section: Architecture Of the Hp-enuf Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Therefore, we proposed a hybrid parallel scheme combining multiple CPU and GPU devices to upgrade the CU-ENUF method, which is described as HP-ENUF method. [44,45] Similar to the CU-ENUF method, [43]…”
Section: Architecture Of the Hp-enuf Methodsmentioning
confidence: 99%
“…[8,40,41] In addition, several (hybrid) parallelization strategies based on gridding [42] and Near-Distance algorithms [43] have been developed to accelerate the evaluation of electrostatic energies and forces using GPU and CUDA technology. [44,45] These derivatives of the ENUF and ENUF-DPD methods exhibit distinct computational efficiencies in handling long range electrostatic interactions between charged particles and charge density distributions at multiple spatiotemporal scales.…”
Section: Introductionmentioning
confidence: 99%
“…An other research group [2] also proposes GPU sparse FFT algorithm based on parallel optimization and the authors mention that this algorithm "leads enormous speedups". Moreover, in a very recent research [23], authors propose an hybrid MPI -CUDA implementation for nonequispaced discrete Fourier transformation using parallel threads launched from CPU nodes for managing the thread-level parallelism in multiple GPU devices. The authors prove that using hybrid parallelization, an increased improvement in computational efficiency is obtained without losing the computational precision.…”
Section: Cuda For Fourier Transformmentioning
confidence: 99%
“…The authors prove that using hybrid parallelization, an increased improvement in computational efficiency is obtained without losing the computational precision. Also, their method can balance in a dynamic way the connection between performance and throughput capacity by modifying the number of computer nodes used for parallel computations [23].…”
Section: Cuda For Fourier Transformmentioning
confidence: 99%