2005
DOI: 10.1007/s11227-005-0298-8
|View full text |Cite
|
Sign up to set email alerts
|

Hyperplane Grouping and Pipelined Schedules: How to Execute Tiled Loops Fast on Clusters of SMPs

Abstract: Abstract. This paper proposes a novel approach for the parallel execution of tiled Iteration Spaces onto a cluster of SMP PC nodes. Each SMP node has multiple CPUs and a single memory mapped PCI-SCI Network Interface Card. We apply a hyperplane-based grouping transformation to the tiled space, so as to group together independent neighboring tiles and assign them to the same SMP node. In this way, intranode (intragroup) communication is annihilated. Groups are atomically executed inside each node. Nodes exchang… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2006
2006
2023
2023

Publication Types

Select...
3
3

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(3 citation statements)
references
References 46 publications
(45 reference statements)
0
3
0
Order By: Relevance
“…To compute the last tile coordinates, relation (2) can be applied to the last iteration point, UB = (ub 1 , ub 2 , . .…”
Section: Wavefront Transformationmentioning
confidence: 99%
See 1 more Smart Citation
“…To compute the last tile coordinates, relation (2) can be applied to the last iteration point, UB = (ub 1 , ub 2 , . .…”
Section: Wavefront Transformationmentioning
confidence: 99%
“…11 An algorithm for computing the completion time of a scheduling total execution time or the completion time of a scheduling is given. In this algorithm it is assumed that the computation time of each tile and communication time between dependent tiles are overlapped [2,8,14]. Also, message passing between processors is performed in parallel and processors start to execute all the tiles on the same wavefront at the same time.…”
Section: Scheduling N-dimensional Tilesmentioning
confidence: 99%
“…Athanasaki et al describe another approach that also uses tiling to reduce communication for distributed memory based clusters [2]. In their approach, an additional tiling transformation is used for aggregating processor tiles along certain hyperplanes into so-called groups, which can be executed efficiently by exploiting the availability of communicationfree shared memory processors on each node in the cluster.…”
Section: Related Workmentioning
confidence: 99%