2010
DOI: 10.1109/tc.2009.180
|View full text |Cite
|
Sign up to set email alerts
|

Architectures and Execution Models for Hardware/Software Compilation and Their System-Level Realization

Abstract: Abstract-We propose an execution model that orchestrates the fine-grained interaction of a conventional general-purpose processor (GPP) and a high-speed reconfigurable hardware accelerator (HA), the latter having full master-mode access to memory. We then describe how the resulting requirements can actually be realized efficiently in a custom computer by hardware architecture and system software measures. One of these is a low-latency HA-to-GPP signaling scheme with latency up to 23x times shorter than convent… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
15
0

Year Published

2010
2010
2014
2014

Publication Types

Select...
6
3

Relationship

3
6

Authors

Journals

citations
Cited by 25 publications
(15 citation statements)
references
References 15 publications
0
15
0
Order By: Relevance
“…We modified the Reference Design by inserting the TechMod into the MCI bus between PowerPC and external 200 MHz DDR2-SDRAM main memory. The GPP can thus access the HA in a simple memory mapped fashion, while the 100 MHz HA can directly access main memory, following our FastLane architecture [16] of giving the GPP priority over the HA to ensure stable system operation. The system was implemented using Xilinx EDK and ISE 10.3, and Synplify Premier DP 9.6.1.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…We modified the Reference Design by inserting the TechMod into the MCI bus between PowerPC and external 200 MHz DDR2-SDRAM main memory. The GPP can thus access the HA in a simple memory mapped fashion, while the 100 MHz HA can directly access main memory, following our FastLane architecture [16] of giving the GPP priority over the HA to ensure stable system operation. The system was implemented using Xilinx EDK and ISE 10.3, and Synplify Premier DP 9.6.1.…”
Section: Methodsmentioning
confidence: 99%
“…While explored mostly for GPPs [13], [17], [21], speculative execution is not limited to that domain [3] and can also be used in the parallel paradigms used for HAs [16].…”
Section: A Speculation In Temporal and Spatial Compute Modelsmentioning
confidence: 99%
“…As target platform, we employ a Xilinx ML507 board (Virtex-5 FX-based), using the hardware and software environment described in [16] to achieve high-throughput low-latency access to shared memory between the accelerator(s) and the general-purpose PowerPC 440 processor. As the XC5VFX70T device on the actual board is too small to hold the complete system-on-chip (processor buses, memory controller, network interface, etc.)…”
Section: B Impact On High-level Synthesismentioning
confidence: 99%
“…At this level, the loop has been encapsulated as a single operation. When it detects the loop termination condition, it signals the end of hardware execution to the hardware/software interface layer [16] and passes back the computed factorial from hardware to software. Since we compile for the ACS target to a fully spatial hardware implementation with no operator reuse, we can employ a variant of the classical As-Soon-AsPossible (ASAP) static scheduling algorithm [21], adding just minor extensions to obey explicit constraints (discussed in Section 4.4).…”
Section: Hardware Synthesis In Nymblementioning
confidence: 99%