2011
DOI: 10.1007/978-3-642-23397-5_43
|View full text |Cite
|
Sign up to set email alerts
|

Automatic OpenCL Device Characterization: Guiding Optimized Kernel Design

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
15
0

Year Published

2013
2013
2020
2020

Publication Types

Select...
4
3
2

Relationship

4
5

Authors

Journals

citations
Cited by 26 publications
(15 citation statements)
references
References 7 publications
0
15
0
Order By: Relevance
“…This is related to the weaker performance of the GPU (Ati Radeon HD5870) in mc1. Its VLIW architecture with very wide instruction width and high branch miss penalty would require specific fine-tuning of each code to perform well [33]. However, none of our test cases was tuned for a specific device.…”
Section: Analysis Of the Resultsmentioning
confidence: 99%
“…This is related to the weaker performance of the GPU (Ati Radeon HD5870) in mc1. Its VLIW architecture with very wide instruction width and high branch miss penalty would require specific fine-tuning of each code to perform well [33]. However, none of our test cases was tuned for a specific device.…”
Section: Analysis Of the Resultsmentioning
confidence: 99%
“…This is a result of the weaker performance of the GPU in mc1. The VLIW architecture with a very wide instruction width and high branch miss penalty would require specific fine-tuning of each code to perform well [7]. However, none of our test cases was tuned for a specific device.…”
Section: Discussionmentioning
confidence: 99%
“…In fact, the families of OpenCL benchmarks that support multiple devices that we know of, such as the SNU NPB suite [19], use this approach to exchange data between devices. Also, the benchmarks to characterize OpenCL [22] have never compared these two possibilities as far as we know. We have found, however, that clEnqueueCopyBuffer can be in fact much slower than the two sequenced transfers possibility in some systems.…”
Section: Implementation Detailsmentioning
confidence: 99%