2014 IEEE 25th International Conference on Application-Specific Systems, Architectures and Processors 2014
DOI: 10.1109/asap.2014.6868669
|View full text |Cite
|
Sign up to set email alerts
|

Understanding the design space of DRAM-optimized hardware FFT accelerators

Abstract: As technology scaling is reaching its limits, pointing to the well-known memory and power wall problems, achieving high-performance and energy-efficient systems is becoming a significant challenge. Especially for data-intensive computing, efficient utilization of the memory subsystem is the key to achieve high performance and energy efficiency. We address this challenge in DRAM-optimized hardware accelerators for 1D, 2D and 3D fast Fourier transforms (FFT) on large datasets. When the dataset has to be stored i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2014
2014
2020
2020

Publication Types

Select...
5
4

Relationship

2
7

Authors

Journals

citations
Cited by 17 publications
(8 citation statements)
references
References 20 publications
0
8
0
Order By: Relevance
“…In the past there have been several implementations using local memory; however, with growing demand for larger image sizes external memory has to be used. There have been several DRAM remapping attempts before, such as [5,13]. They propose a tile-based approach where an × image (input array) is divided into / × / tiles where is the size of the DRAM row buffer which allows for very high-bandwidth DRAM access.…”
Section: Proposed: Tile-hopping Memory Mapping For 2d Fftsmentioning
confidence: 99%
See 1 more Smart Citation
“…In the past there have been several implementations using local memory; however, with growing demand for larger image sizes external memory has to be used. There have been several DRAM remapping attempts before, such as [5,13]. They propose a tile-based approach where an × image (input array) is divided into / × / tiles where is the size of the DRAM row buffer which allows for very high-bandwidth DRAM access.…”
Section: Proposed: Tile-hopping Memory Mapping For 2d Fftsmentioning
confidence: 99%
“…However, in the case of 2D DFTs, 1D FFTs have to be computed in two dimensions, increasing the complexity to O( 2 log ), thereby making 2D DFTs a significant bottleneck for real-time machine vision applications [7]. Recently, there has been substantial effort to achieve high-performance implementations of multidimensional FFTs to overcome this constraint [5,[7][8][9][10][11][12][13][14]. Due to their inherent parallelism and reconfigurability, Field Programmable Gate Arrays (FPGAs) are attractive targets for accelerating FFT computations.…”
Section: Introductionmentioning
confidence: 99%
“…There are also several related work demonstrating 3D-stacked DRAM and processing elements integrated together for application acceleration [8], [26], [27], and for general purpose computing [6], [10], [5], [7]. Integrating a high-performance general purpose processor underneath the DRAM raises some thermal issues, on the other hand, energy-efficient accelerators are specific to a certain application.…”
Section: Related Workmentioning
confidence: 99%
“…The re-gridding and FFT units are implemented in the logic layer of a 3D-stacked DRAM similar to [15], [16]. The 2D-FFT requires double-buffered local memory that performs data permutations and a local FFT core that executes the FFT kernel [13], [16].…”
Section: D-ifft and System Integrationmentioning
confidence: 99%