2012 International Conference for High Performance Computing, Networking, Storage and Analysis 2012
DOI: 10.1109/sc.2012.97
|View full text |Cite
|
Sign up to set email alerts
|

Portable section-level tuning of compiler parallelized applications

Abstract: Automatic parallelization of sequential programs combined with tuning is an alternative to manual parallelization. This method has the potential to substantially increase productivity and is thus of critical importance for exploiting the increased computational power of today's multicores. A key difficulty is that parallelizing compilers are generally unable to estimate the performance impact of an optimization on a whole program or a program section at compile time; hence, the ultimate performance decision to… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2012
2012
2022
2022

Publication Types

Select...
3
3

Relationship

1
5

Authors

Journals

citations
Cited by 7 publications
(10 citation statements)
references
References 31 publications
0
10
0
Order By: Relevance
“…Profile-based optimization lies between program-level and section-level tuning [3]. It is similar to section-level tuning, with a p-window size of 1(discussed in section B.2), and a constant parallelization threshold for all loops in the program.…”
Section: ) Profile-based Optimizationmentioning
confidence: 99%
See 1 more Smart Citation
“…Profile-based optimization lies between program-level and section-level tuning [3]. It is similar to section-level tuning, with a p-window size of 1(discussed in section B.2), and a constant parallelization threshold for all loops in the program.…”
Section: ) Profile-based Optimizationmentioning
confidence: 99%
“…An automatic tuning system is needed to tune program performance quickly in an exponential search space, and to achieve high program performance. In previous work [3], we introduced a novel, fast tuning algorithm that is able to optimize individual program sections.…”
Section: Introductionmentioning
confidence: 99%
“…Concerning autotuning, there are several offline approaches focusing the tuning of OpenMP parameters [30,31,32], which may also be knobs on the ANTAREX approach. There are also frameworks focused on more general, application-level autotuning [33,34,35].…”
Section: Related Workmentioning
confidence: 99%
“…For a shared memory system, code with OpenMP directives is to be generated. The C file generated by the CI can be further optimized using existing autotuners [5,27,28]. …”
Section: Code Instrumentormentioning
confidence: 99%