2013 International Conference on Parallel and Distributed Systems 2013
DOI: 10.1109/icpads.2013.48
|View full text |Cite
|
Sign up to set email alerts
|

Online Performance Projection for Clusters with Heterogeneous GPUs

Abstract: Abstract-We present a fully automated approach to project the relative performance of an OpenCL program over different GPUs. Performance projections can be made within a small amount of time, and the projection overhead stays relatively constant with the input data size. As a result, the technique can help runtime tools make dynamic decisions about which GPU would run faster for a given kernel. Usage cases of this technique include scheduling or migrating GPU workloads over a heterogeneous cluster with differe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2015
2015
2016
2016

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 21 publications
0
1
0
Order By: Relevance
“…To select the best device, we need to know only the kernel's relative performances and not necessarily the absolute kernel performances. Therefore, we create a technique called minikernel profiling, which is conceptually similar to our miniemulation technique [20], where we run just a single workgroup of the kernel on each participating device and collect the relative performances in the kernel profiles. Our approach dramatically reduces runtime overhead, as discussed in Section 6.…”
Section: Minikernel Profiling For Compute-intensive Kernelsmentioning
confidence: 99%
“…To select the best device, we need to know only the kernel's relative performances and not necessarily the absolute kernel performances. Therefore, we create a technique called minikernel profiling, which is conceptually similar to our miniemulation technique [20], where we run just a single workgroup of the kernel on each participating device and collect the relative performances in the kernel profiles. Our approach dramatically reduces runtime overhead, as discussed in Section 6.…”
Section: Minikernel Profiling For Compute-intensive Kernelsmentioning
confidence: 99%