2020 International Workshop on Rapid System Prototyping (RSP) 2020
DOI: 10.1109/rsp51120.2020.9244859
|View full text |Cite
|
Sign up to set email alerts
|

A combined fast/cycle accurate simulation tool for reconfigurable accelerator evaluation: application to distributed data management

Abstract: Parallel computing systems based on reconfigurable accelerators are becoming (1) increasingly heterogeneous, (2) difficult to design and (3) complex to model. Such modeling of a parallel computing system helps to evaluate its performance and to improve its architecture before prototyping. This paper presents a simulation tool aiming to study the integration of reconfigurable accelerators in scalable distributed systems and runtimes, such as S-DSM systems, where S-DSM (softwaredistributed shared memory) is a pa… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 23 publications
(18 reference statements)
0
2
0
Order By: Relevance
“…We can adapt the monitoring and strategically place the set of probes depending on the architecture parts to be evaluated. For example, Lenormand et al [7] propose an FPGA implementation integrating a monitor to evaluate and optimize a matrix multiplication accelerator. The monitor allows them to measure the performance of an accelerator, to find its optimal settings and to identify its bottlenecks.…”
Section: Related Workmentioning
confidence: 99%
“…We can adapt the monitoring and strategically place the set of probes depending on the architecture parts to be evaluated. For example, Lenormand et al [7] propose an FPGA implementation integrating a monitor to evaluate and optimize a matrix multiplication accelerator. The monitor allows them to measure the performance of an accelerator, to find its optimal settings and to identify its bottlenecks.…”
Section: Related Workmentioning
confidence: 99%
“…However, moving data from a CPU host and an accelerator (GPGPU, FPGA) is still the responsibility of the user, as it is commonly done for MPI/OpenMP or MPI/CUDA in HPC systems. In this work [14], a system is proposed to transparently manage these interactions between hosts and FPGAs based on this S-DSM, providing a full DSM implementation among the processing elements, however this is not in the scope of this report.…”
Section: S-dsm Programming Modelmentioning
confidence: 99%