2012 IEEE 26th International Parallel and Distributed Processing Symposium 2012
DOI: 10.1109/ipdps.2012.128
|View full text |Cite
|
Sign up to set email alerts
|

Predicting Potential Speedup of Serial Code via Lightweight Profiling and Emulations with Memory Performance Model

Abstract: We present Parallel Prophet, which projects potential parallel speedup from an annotated serial program before actual parallelization. Programmers want to see how much speedup could be obtained prior to investing time and effort to write parallel code. With Parallel Prophet, programmers simply insert annotations that describe the parallel behavior of the serial program. Parallel Prophet then uses lightweight interval profiling and dynamic emulations to predict potential performance benefit. Parallel Prophet mo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2014
2014
2020
2020

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 19 publications
(4 citation statements)
references
References 21 publications
0
4
0
Order By: Relevance
“…Kim et al and Wang et al modeled local memory bandwidth for multi-core processors [21,43]. Eklov et al characterized the performance impact of memory contention [44].…”
Section: Related Workmentioning
confidence: 99%
“…Kim et al and Wang et al modeled local memory bandwidth for multi-core processors [21,43]. Eklov et al characterized the performance impact of memory contention [44].…”
Section: Related Workmentioning
confidence: 99%
“…Kim et al propose an approach that predicts potential speedup from sequential execution [5]. Theoretical analysis of speedup of workloads on modern symmetric and asymmetric is provided in [33].…”
Section: Related Workmentioning
confidence: 99%
“…For example, a speedup model is usually used to quantify the benefits introduced by parallel computing in terms of execution time [5]. Higher concurrency levels, however, affect power dissipation (P ) because not only additional computing units are activated but also the power dissipation of common components on a chip will be shared by more cores.…”
Section: Introductionmentioning
confidence: 99%
“…The work by [10] is an early attempt to build a performance prediction model for a given CUDA program, whereas our prediction model is based on a sequential program. The work by [9] estimates potential speed-up using an annotated serial program. However, unlike ours, this work does not consider the data marshaling cost while calculating the speed-ups and the approach does not deal with meeting a target speed-up.…”
Section: Related Workmentioning
confidence: 99%