Proceedings of the 8th International Symposium on Highly Efficient Accelerators and Reconfigurable Technologies 2017
DOI: 10.1145/3120895.3120909
|View full text |Cite
|
Sign up to set email alerts
|

FPGA-based Stream Computing for High-Performance N-Body Simulation using Floating-Point DSP Blocks

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(4 citation statements)
references
References 9 publications
0
4
0
Order By: Relevance
“…As the porting of the OmpSs@FPGA framework to the Alveo boards is still an ongoing work, this number is expected to grow as a more mature implementation that allows a more thorough testing is developed. This value of 57.67 bapps can be compared with 11 bapps obtained by Sano et al [ 9 ] for force calculations with using cut-off distances (although we are talking about different technologies, Alveo U200 vs. Arria 10 with more than 5 years difference). A more direct comparison can be made on the histogram construction by A. Leonardi and D.L.…”
Section: Resultsmentioning
confidence: 56%
See 1 more Smart Citation
“…As the porting of the OmpSs@FPGA framework to the Alveo boards is still an ongoing work, this number is expected to grow as a more mature implementation that allows a more thorough testing is developed. This value of 57.67 bapps can be compared with 11 bapps obtained by Sano et al [ 9 ] for force calculations with using cut-off distances (although we are talking about different technologies, Alveo U200 vs. Arria 10 with more than 5 years difference). A more direct comparison can be made on the histogram construction by A. Leonardi and D.L.…”
Section: Resultsmentioning
confidence: 56%
“…This demonstrates that current FPGA boards can provide good results for these kinds of O(N 2 ) problems. In this context, Sano et al [ 9 ] obtained the best results, for a single FPGA, on the Arria 10, with a score of 10.94 billion force calculations per second for 262,144 particles, at 180 MHz kernel frequency. A direct comparison is not possible because Sano et al [ 9 ] not only computed the distance between all the pairs, but also this information was further used for the calculation of the total force experienced by each body.…”
Section: Introductionmentioning
confidence: 99%
“…There have been other efforts to implement the N-body on FPGAs and even ASICS. In [10], Sano et al implement a full custom FPGA design that solves the N-body problem on a single Intel Arria10 FPGA, achieving 10.944 Gpairs/s. In [6], de Haro et al uses OmpSs@FPGA to execute the N-body on a Xilinx Alveo U200 board reaching 37.62 Gpairs/s with a performance per watt of 0.58.…”
Section: Related Workmentioning
confidence: 99%
“…However, they only propose a design for the force calculation, and do not take into account more than one simulation step, which requires to update the positions and velocities. In [12] Sano et. al.…”
Section: N-bodymentioning
confidence: 99%