2021 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) 2021
DOI: 10.1109/ipdpsw52791.2021.00105
|View full text |Cite
|
Sign up to set email alerts
|

cuFINUFFT: a load-balanced GPU library for general-purpose nonuniform FFTs

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
3
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 14 publications
(5 citation statements)
references
References 24 publications
0
3
0
Order By: Relevance
“…It is worth noting that the authors of FINUFFT [27] are working on a GPU implementation of the API which, at the time of writing, is still incomplete. According to their benchmarks [50], their implementation is much faster than gpuNUFFT [51] that we used in this paper. When cuFINUFFT is completed, we will integrate it in our implementation in order to produce the same output on both CPU and GPU, thus making the evaluation even fairer and, additionally, improving the performance.…”
Section: Discussionmentioning
confidence: 99%
“…It is worth noting that the authors of FINUFFT [27] are working on a GPU implementation of the API which, at the time of writing, is still incomplete. According to their benchmarks [50], their implementation is much faster than gpuNUFFT [51] that we used in this paper. When cuFINUFFT is completed, we will integrate it in our implementation in order to produce the same output on both CPU and GPU, thus making the evaluation even fairer and, additionally, improving the performance.…”
Section: Discussionmentioning
confidence: 99%
“…This can be done by computing D non-uniform Fourier transforms (see Wang et al (2022); Gossard et al (2022); Wang and Fessler (2021)). Different packages were tested and we finally opted for the cuFINUFFT implementation Shih et al (2021). The bindings for different kind of NUFT are available at https://github.com/albangossard/Bindings-NUFFT-pytorch/.…”
Section: Ethical Standardsmentioning
confidence: 99%
“…Therefore, the total latency can be limited below 500 ms (66 ms+ 250 ms+150 ms=466 ms). Moreover, algorithm optimization and dedicated hardware to improve the efficiency of NUFFT and DVF computations can further reduce the latency (Knoll et al 2014, Shih et al 2021.…”
Section: Prospect Of Real-time Mr Imagingmentioning
confidence: 99%