Proceedings. IEEE International Conference on Cluster Computing
DOI: 10.1109/clustr.2002.1137768
|View full text |Cite
|
Sign up to set email alerts
|

Compiling tiled iteration spaces for clusters

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
8
0

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 11 publications
(8 citation statements)
references
References 13 publications
0
8
0
Order By: Relevance
“…Each MPI node assumes the execution of a sequence of tiles, successive along the longest dimension of the original iteration space. The complete methodology is described more extensively in [5]. It must be noted that since our prime objective was to experimentally verify the performance benefits of the different parallelization models, for the sake of simplicity we resorted to hand-made parallelization, as opposed to automatic parallelization.…”
Section: Pure Mpi Paradigmmentioning
confidence: 99%
“…Each MPI node assumes the execution of a sequence of tiles, successive along the longest dimension of the original iteration space. The complete methodology is described more extensively in [5]. It must be noted that since our prime objective was to experimentally verify the performance benefits of the different parallelization models, for the sake of simplicity we resorted to hand-made parallelization, as opposed to automatic parallelization.…”
Section: Pure Mpi Paradigmmentioning
confidence: 99%
“…For the first two problems there is a skewed and an unskewed version [12], and for each version there are four (communication and scheduling) optimal tiling matrices (P1 − P4), calculated as described in [14] and [26]. The compilation efficiency of a method is measured by means of the compilation time, which increases as the number of row-operations performed increases.…”
Section: Measuring the Compilation Time And Performance Of Sequentialmentioning
confidence: 99%
“…In each case, we applied rectangular and non-rectangular tiling transformations. Although, as described in [12], non-rectangular tiling can be directly applied to the initial SOR and Jacobi code, in order to compare rectangular v.s. non-rectangular tiling, we apply them to the skewed version of their iteration space.…”
Section: Measuring the Performance Of Arbitrarily Tiled Parallel Codementioning
confidence: 99%
“…On the other hand in distributed memory computer systems, it is known that coarse-grain methods such as the ones presented in [10], [11], [12] offer better data locality and more efficient use of the memory than the fine-grain approaches. The use of coarse-grain methods can lead to performance improvements also in the case of embedded systems.…”
Section: Introductionmentioning
confidence: 99%