2015
DOI: 10.1007/s11265-015-1051-z
|View full text |Cite
|
Sign up to set email alerts
|

OpenDwarfs: Characterization of Dwarf-Based Benchmarks on Fixed and Reconfigurable Architectures

Abstract: The proliferation of heterogeneous computing platforms presents the parallel computing community with new challenges. One such challenge entails evaluating the efficacy of such parallel architectures and identifying the architectural innovations that ultimately benefit applications. To address this challenge, we need benchmarks that capture the execution patterns (i.e., dwarfs or motifs) of applications, both present and future, in order to guide future hardware design. Furthermore, we desire a common programm… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
16
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 30 publications
(16 citation statements)
references
References 16 publications
0
16
0
Order By: Relevance
“…However, work-items accessing data along a column of the matrix do not observe memory access coalescing. us we observe that while cache hit rates are typically low on this benchmark, particularly on GPUs [12], GPUs can hide the latency of global memory accesses through memory access coalescing to some extent.…”
Section: Structured Grids: Speckle Reducing Anisotropicmentioning
confidence: 81%
See 3 more Smart Citations
“…However, work-items accessing data along a column of the matrix do not observe memory access coalescing. us we observe that while cache hit rates are typically low on this benchmark, particularly on GPUs [12], GPUs can hide the latency of global memory accesses through memory access coalescing to some extent.…”
Section: Structured Grids: Speckle Reducing Anisotropicmentioning
confidence: 81%
“…is indicates that almost all memory accesses made by the kernel are perfectly synchronized between OpenCL threads. Performance results [9,12] show that GEM performs signi cantly be er on GPUs than on CPUs, as memory unit stalls are at low levels for both CPUs and GPUs due to the highly e cient memory utilization of this benchmark. As memory operations do not present a bo leneck, this benchmark is able to take advantage of the superior oating-point compute capability of GPUs [12].…”
Section: N-body Methods: Gemmentioning
confidence: 99%
See 2 more Smart Citations
“…In this paper, we use the OpenDwarfs benchmark suite [3], a suite of architecture-agnostic OpenCL kernels that capture common computation and communication patterns across a wide spectrum of scientific and engineering applications, to study the performance of the OpenCL programming model on FPGAs. In OpenDwarfs, none of the dwarfs contain optimizations that favor a specific architecture over another.…”
Section: Introductionmentioning
confidence: 99%