2020
DOI: 10.1109/tcad.2020.2970597
|View full text |Cite
|
Sign up to set email alerts
|

FLASH: Fast, Parallel, and Accurate Simulator for HLS

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
11
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
7
1

Relationship

3
5

Authors

Journals

citations
Cited by 19 publications
(11 citation statements)
references
References 26 publications
0
11
0
Order By: Relevance
“…Performance counters are inserted to the accelerator to collect the relevant metrics. We implement SPLAG using an open-source extension to HLS C++, TAPA [10], to leverage the convenient peeking interfaces, fast software simulation [6,12], asynchronous memory interfaces, simplified host-kernel interfaces, and coarse-grained floorplanning [26,27]. Our implementation targets the Alveo U280 board with 32 high-bandwidth memory (HBM) channels.…”
Section: Discussionmentioning
confidence: 99%
“…Performance counters are inserted to the accelerator to collect the relevant metrics. We implement SPLAG using an open-source extension to HLS C++, TAPA [10], to leverage the convenient peeking interfaces, fast software simulation [6,12], asynchronous memory interfaces, simplified host-kernel interfaces, and coarse-grained floorplanning [26,27]. Our implementation targets the Alveo U280 board with 32 high-bandwidth memory (HBM) channels.…”
Section: Discussionmentioning
confidence: 99%
“…The 200 MHz target frequency of other FPGA designs remain unchanged. [48,89] and by offloading simulation to an FPGA [64,65,97] or a GPU [91]. Our debugging tools are designed for both on-FPGA and simulation-based debugging.…”
Section: Efficiency Of Debugging Toolsmentioning
confidence: 99%
“…However, they may perform poorly due to the ine ciency of inter-thread communication and context switch handled by the operating system. e FLASH simulator [8,12] proposed an alternative to the above, which relies on the HLS scheduling information to mimic the RTL FSM. While this simulation approach itself is faster than multi-thread simulators, generating simulation executable becomes slower due to the need of the HLS scheduler output for cycle-accuracy, which is not needed for correctness veri cation.…”
Section: So Ware Simulationmentioning
confidence: 99%