2016
DOI: 10.1109/tcad.2016.2545408
|View full text |Cite
|
Sign up to set email alerts
|

Efficient Hardware Design of Iterative Stencil Loops

Abstract: obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.The WestminsterResearch online digital archive at the University of Westminster aims to make the research output of the University available to a wider audience. Copyright and Moral Rights remain with the authors and/o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
8
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(8 citation statements)
references
References 37 publications
0
8
0
Order By: Relevance
“…In this section, we provide an example that illustrates why the DCMI acceleration strategy is superior to current state-of-the-art FPGA-based accelerators [4,40]. We first explain the operation of the stencil compute kernel in Section 2.1.…”
Section: Explaining the Efficiency Of Dcmimentioning
confidence: 99%
See 4 more Smart Citations
“…In this section, we provide an example that illustrates why the DCMI acceleration strategy is superior to current state-of-the-art FPGA-based accelerators [4,40]. We first explain the operation of the stencil compute kernel in Section 2.1.…”
Section: Explaining the Efficiency Of Dcmimentioning
confidence: 99%
“…The two main sources of parallelism in ISLs are spatial and temporal parallelism. Broadly, two main approaches have been proposed for exploiting both forms of parallelism in FPGA-based accelerators, and Streaming Time-Steps (SST) [4] and the Cone-based Architecture (CA) [40] are representative of these two strategies. In this section, we show that both SST and CA are inefficient, because they perform redundant computation and use OCM inefficiently.…”
Section: State-of-the-art Isl Acceleration Approaches: Sst and Camentioning
confidence: 99%
See 3 more Smart Citations