Cross-architecture performance prediction (XAPP) using CPU code to predict GPU performance

Ardalani, Newsha; Lestourgeon, Clint; Sankaralingam, Karthikeyan; Zhu, Xiaojin

doi:10.1145/2830772.2830780

Cited by 89 publications

(43 citation statements)

References 44 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Reference [7] uses both static attributes such as numbers of different types of instructions and dynamic attributes such as number of cache misses for regression. Reference [2] proposes cross-architecture performance prediction. It is a machine training based technique using both static and dynamic attributes from many programs from some/different benchmarks.…”

Section: Related Workmentioning

confidence: 99%

Comparing DSP Software Performance Prediction Models at Source Code Level — From Analytical to Statistical

Hu¹,

Liu²,

Su³

et al. 2019

JSW

View full text Add to dashboard Cite

Efficient performance prediction at source code level is essential in reducing the turnaround time of software development, particularly when the source code is subject to changes due to modification of problem specification. In this paper, we investigate and compare five performance prediction models from practical standpoint to determine the usefulness of these models. To verify the effectiveness of these models, we select a set of functions from PHY DSP Benchmark and TIC64 DSP processor for experiment. Comparing the predicted performance to the actual measured execution time, we observed that the relative prediction error generated from two of the five models are low and can thus be used for practical purposes.

show abstract

Section: Related Workmentioning

confidence: 99%

Comparing DSP Software Performance Prediction Models at Source Code Level — From Analytical to Statistical

Hu¹,

Liu²,

Su³

et al. 2019

JSW

View full text Add to dashboard Cite

show abstract

“…Ardalani et al also used machine learning to train GPU performance models [7]. Their modeling included two techniques: the forward feature selection stepwise regression and the bootstrap aggregating.…”

Section: A Pipeline Analysismentioning

confidence: 99%

A survey and measurement study of GPU DVFS on energy conservation

Mei

Wang

Chu

2017

Digital Communications and Networks

View full text Add to dashboard Cite

Energy efficiency has become one of the top design criteria for current computing systems. The dynamic voltage and frequency scaling (DVFS) has been widely adopted by laptop computers, servers, and mobile devices to conserve energy, while the GPU DVFS is still at a certain early age. This paper aims at exploring the impact of GPU DVFS on the application performance and power consumption, and furthermore, on energy conservation. We survey the state-of-the-art GPU DVFS characterizations, and then summarize recent research works on GPU power and performance models. We also conduct real GPU DVFS experiments on NVIDIA Fermi and Maxwell GPUs. According to our experimental results, GPU DVFS has significant potential for energy saving. The effect of scaling core voltage/frequency and memory voltage/frequency depends on not only the GPU architectures, but also the characteristic of GPU applications.

show abstract

“…Each slice is decomposed into three subgroups: (1) the Slice Common (Figure 8) which provides additional fixed function architectural units; (2) the Sub-Slice (Figure 9) which contains 24 Execution Units (EUs) and supporting execution hardware; and (3) an L3 cache. RastSim models only the portions of the Slice Common and Sub-Slice that are needed to provide functionally correct rendering.…”

Section: Slice Architecturementioning

confidence: 99%

GPU Performance Estimation using Software Rasterization and Machine Learning

O’Neal

Brisk

Abousamra

et al. 2017

ACM Trans. Embed. Comput. Syst.

View full text Add to dashboard Cite

This paper introduces a predictive modeling framework to estimate the performance of GPUs during presilicon design. Early-stage performance prediction is useful when simulation times impede development by rendering driver performance validation, API conformance testing and design space explorations infeasible. Our approach builds a Random Forest regression model to analyze DirectX 3D workload behavior when executed by a software rasterizer, which we have extended with a workload characterizer to collect further performance information via program counters. In addition to regression models, this work produces detailed feature rankings which can provide valuable architectural insight, and accurate performance estimates for an Intel integrated Skylake generation GPU. Our models achieve reasonable out-of-sample-error rates of 14%, with an average simulation speedup of 327x.

show abstract

Cross-architecture performance prediction (XAPP) using CPU code to predict GPU performance

Cited by 89 publications

References 44 publications

Comparing DSP Software Performance Prediction Models at Source Code Level — From Analytical to Statistical

Comparing DSP Software Performance Prediction Models at Source Code Level — From Analytical to Statistical

A survey and measurement study of GPU DVFS on energy conservation

GPU Performance Estimation using Software Rasterization and Machine Learning

Contact Info

Product

Resources

About