2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines 2011
DOI: 10.1109/fccm.2011.29
|View full text |Cite
|
Sign up to set email alerts
|

Multilevel Granularity Parallelism Synthesis on FPGAs

Abstract: Abstract-Recent progress in High-Level Synthesis (HLS) techniques has helped raise the abstraction level of FPGA programming. However implementation and performance evaluation of the HLS-generated RTL, involves lengthy logic synthesis and physical design flows. Moreover, mapping of different levels of coarse grained parallelism onto hardware spatial parallelism affects the final FPGA-based performance both in terms of cycles and frequency. Evaluation of the rich design space through the full implementation flo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
12
0

Year Published

2012
2012
2023
2023

Publication Types

Select...
5
4
1

Relationship

3
7

Authors

Journals

citations
Cited by 44 publications
(12 citation statements)
references
References 14 publications
0
12
0
Order By: Relevance
“…FCUDA [3] is a source-to-source compiler from NVidia's CUDA language to synthesizable AutoPilot C. Building on FCUDA, a highlevel parameter space exploration tool is presented in [6]. The tool first builds a resource and clock-period estimation models by actually synthesizing a small number of sample configurations.…”
Section: Related Workmentioning
confidence: 99%
“…FCUDA [3] is a source-to-source compiler from NVidia's CUDA language to synthesizable AutoPilot C. Building on FCUDA, a highlevel parameter space exploration tool is presented in [6]. The tool first builds a resource and clock-period estimation models by actually synthesizing a small number of sample configurations.…”
Section: Related Workmentioning
confidence: 99%
“…Thus an interesting study would try to find the best of both worlds: take massively parallel applications implemented in massively parallel programming languages and figure out how to map them to ASICs or FPGAs. FCUDA [40], [41], [71], [72], [73], [74] is one such study. The FCUDA compilation process is illustrated in Fig.…”
Section: Fcudamentioning
confidence: 99%
“…This work proposes efficient design space exploration techniques for applications that consist of multiple nested loops with or without dataflow dependence. In addition, instead of searching all possible configurations [2][3] [14][20], we prune the design space by eliminating the dominated configurations. Accurate performance and area models are also developed to assist the design space exploration to reduce number of invocation of the HLS tool.…”
Section: Related Workmentioning
confidence: 99%