Proceedings of the 16th ACM Symposium on Principles and Practice of Parallel Programming 2011
DOI: 10.1145/1941553.1941589
|View full text |Cite
|
Sign up to set email alerts
|

Auto-tuning of fast fourier transform on graphics processors

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
16
0

Year Published

2011
2011
2020
2020

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 53 publications
(16 citation statements)
references
References 10 publications
0
16
0
Order By: Relevance
“…Previous works on fast Fourier transform for the GPU such as [17] includes transposition to improve locality for global memory accesses; the authors did not specify whether the transposition is in-place or not, but we believe it is an out-of-place one. We also believe that their work can be enhanced by employing an in-place transposition algorithm like ours to increase the maximum size of dataset allowed for GPU offloading.…”
Section: In-place and Out-of-place Transposition For Gpusmentioning
confidence: 98%
“…Previous works on fast Fourier transform for the GPU such as [17] includes transposition to improve locality for global memory accesses; the authors did not specify whether the transposition is in-place or not, but we believe it is an out-of-place one. We also believe that their work can be enhanced by employing an in-place transposition algorithm like ours to increase the maximum size of dataset allowed for GPU offloading.…”
Section: In-place and Out-of-place Transposition For Gpusmentioning
confidence: 98%
“…Hardware architectures can be dynamically adjusted to target execution speed and much more [18][19][20]. In High Performance Computing it is common to adapt a running application; e.g., tuning FFTs for graphics processing units [21]. Considerable effort has been devoted to MapReduce, which exposes many configurable parameters [22][23][24].…”
Section: Related Workmentioning
confidence: 99%
“…There has been a rich body of work on optimizations and tuning of GPGPU applications [24,34,68,76,84]. Ryoo et al [84] introduced two metrics to prune optimization space by calculating the utilization and efficiency of GPGPU applications.…”
Section: Other Gpgpu Performance Modeling Techniquesmentioning
confidence: 99%