Proceedings of the Eighteenth Annual ACM Symposium on Parallelism in Algorithms and Architectures 2006
DOI: 10.1145/1148109.1148117
|View full text |Cite
|
Sign up to set email alerts
|

A general approach for partitioning N-dimensional parallel nested loops with conditionals

Abstract: Parallel loops account for the greatest amount of parallelism in scientific and numerical codes. For example, most of the DO loops in SPEC CFP2000 and SPEC OMPM2001 are of DOALL type and account for a large percentage of the total execution time. One of the ways to exploit parallelism is to partition the iteration space of a DOALL loop amongst different processors in a parallel processor system. Naturally, a good partitioning is of key importance to achieve high performance and for efficient use of multiproces… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2008
2008
2014
2014

Publication Types

Select...
4
1

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(3 citation statements)
references
References 31 publications
(53 reference statements)
0
3
0
Order By: Relevance
“…For example, improving the parallelization of kernels with diverging branches (parts executed only by a subset of the work-items) is one of the low-hanging fruits. There is some previous work available that is targeted towards enhanced load-balancing which could be adapted to improving the fine-grained parallelization on machines with limited support for predication as well [32]. 1 7 .…”
Section: Discussionmentioning
confidence: 99%
“…For example, improving the parallelization of kernels with diverging branches (parts executed only by a subset of the work-items) is one of the low-hanging fruits. There is some previous work available that is targeted towards enhanced load-balancing which could be adapted to improving the fine-grained parallelization on machines with limited support for predication as well [32]. 1 7 .…”
Section: Discussionmentioning
confidence: 99%
“…Unlike [8], the non-perfect nature of the loop model is not restricted to conditionals. The loop model also supports multi-way loops [14], i.e., multiple loops may be present at the same level.…”
Section: The Approachmentioning
confidence: 99%
“…A significant amount of work has been done in the context of static partitioning of (parallel) loop nests with rectangular as well as non-rectangular iteration spaces [4,5,6,7,8,9,10]. However, the existing techniques are cache-oblivious, i.e., they do not capture the variation in the number of cache misses across the iteration space.…”
Section: Introductionmentioning
confidence: 99%