Proceedings Design, Automation and Test in Europe Conference and Exhibition
DOI: 10.1109/date.2004.1268836
|View full text |Cite
|
Sign up to set email alerts
|

Loop shifting and compaction for the high-level synthesis of designs with complex control flow

Abstract: Abstract

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
14
0
1

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 24 publications
(15 citation statements)
references
References 21 publications
0
14
0
1
Order By: Relevance
“…Directly connected to our work are those of Guo et al [2005], Weinhardt and Luk [2001] and Gupta et al [2004], where hardware is generated after optimizing the kernel loops.…”
Section: Background and Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Directly connected to our work are those of Guo et al [2005], Weinhardt and Luk [2001] and Gupta et al [2004], where hardware is generated after optimizing the kernel loops.…”
Section: Background and Related Workmentioning
confidence: 99%
“…The work in Gupta et al [2004] is part of the SPARK project and uses shifting to expose loop parallelism and then to compact the loop by scheduling multiple operations to execute in parallel. In that case, loop shifting is performed at low level, whereas we perform it at a high functional level.…”
Section: Background and Related Workmentioning
confidence: 99%
“…High-level synthesis tools attempt to overcome this limitation by statically unrolling, flattening and pipelining loops in order to decrease the number of backwards branches that would be dynamically executed [14], but this can significantly increase the complexity of the centralized finite-state machines that implement the static schedule for the hardware datapath, resulting in very long combinational paths that can overwhelm any gains in IPC [15], [16]. Overcoming the performance limitations due to explicit control flow is the key issue that needs to be addressed for custom hardware to become performancecompetitive with conventional processors on sequential code.…”
Section: The Superscalar Performance Advantagementioning
confidence: 99%
“…However, implementing complex algorithms in FPGA-based systems can be a laborious work. This study is still realized by circuitvendor specific tools in many cases and requires deep design skills, so it remains the most time consuming operation in a design flow (Gupta et al, 2004;Paiz et al, 2008).…”
Section: Introductionmentioning
confidence: 99%