Proceedings of International Workshop on Adaptive Self-Tuning Computing Systems 2014
DOI: 10.1145/2553062.2553067
|View full text |Cite
|
Sign up to set email alerts
|

Roofline-aware DVFS for GPUs

Abstract: DOI to the publisher's website. • The final author version and the galley proof are versions of the publication after peer review. • The final published version features the final layout of the paper including the volume, issue and page numbers. Link to publication General rights Copyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright owners and it is a condition of accessing publications that users recognise and abide by the legal… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

1
11
0

Year Published

2014
2014
2020
2020

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 11 publications
(12 citation statements)
references
References 11 publications
1
11
0
Order By: Relevance
“…Jia et al [5] and Nugteren et al [6] proposed the GPU roofline and DVFS algorithm on the GPU. For easy construction of the roofline model, Steinmann [7] developed and distributed a tool that automatically records the kernel operation intensity on a x86 architecture.…”
mentioning
confidence: 99%
“…Jia et al [5] and Nugteren et al [6] proposed the GPU roofline and DVFS algorithm on the GPU. For easy construction of the roofline model, Steinmann [7] developed and distributed a tool that automatically records the kernel operation intensity on a x86 architecture.…”
mentioning
confidence: 99%
“…The commonly used approach is to either fix the DDR frequency at the highest level, or to map unique pairs of CPU/GPU and DDR frequencies, selected statically (i.e, at design time) to a DVFS level (this is the approach QoS-aware DVFS utilizes). To the best of our knowledge, there are only two previous studies that considered DDR frequency scaling for GPGPU applications [66,68]. Their solutions do not fully consider the interrelationship between the GPU and DDR frequencies for maximum efficiency; also their work is not directly applicable to 3D graphics workloads.…”
Section: Discussionmentioning
confidence: 99%
“…It is notable that variations of performance and/or power models have been used in CPU DVFS techniques before [62,64,13]. Performance [66,68] and power [67] models have also been developed for GPGPU power management Recently, Pathania et al described statistical models for mobile GPU gaming workloads [16]. These existing solutions do not sufficiently satisfy one or more of the following requirements: (1) Low-latency, fine-grained, individual frame-level models for both performance and energy prediction that are required for graphics rendering workloads with soft-realtime deadlines.…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations