1996
DOI: 10.1007/bfb0014213
|View full text |Cite
|
Sign up to set email alerts
|

Automatic selection of dynamic data partitioning schemes for distributed-memory multicomputers

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
13
0

Year Published

1996
1996
2000
2000

Publication Types

Select...
4
2

Relationship

2
4

Authors

Journals

citations
Cited by 29 publications
(13 citation statements)
references
References 24 publications
0
13
0
Order By: Relevance
“…Previous work on parallelism has concentrated, among other topics, on compilation techniques for multicomputers [5,8,51,24], for multiprocessors [47,7], and for automatic discovery of parallelism [21,48,39,18,36,26]. Since neither data layout transformations nor cache locality was the central issue in any of these papers, we do not discuss them here any further.…”
Section: Related Work On Parallelismmentioning
confidence: 97%
“…Previous work on parallelism has concentrated, among other topics, on compilation techniques for multicomputers [5,8,51,24], for multiprocessors [47,7], and for automatic discovery of parallelism [21,48,39,18,36,26]. Since neither data layout transformations nor cache locality was the central issue in any of these papers, we do not discuss them here any further.…”
Section: Related Work On Parallelismmentioning
confidence: 97%
“…Some of the proposals presented in the literature about array r~mapping [:'i. [13][14][15][16][17][18][19][20] art> ~umm:uized in the rest of this section.…”
Section: Introductionmentioning
confidence: 99%
“…and rt>mapping costs into account. Palermo and Barwrjcc [20] also usc a dividc-and-cmi<pu:'r approach in which the program is n·cllrsivt>ly decomposed into a hierarchy of nmdidatP phases. Then.…”
Section: Introductionmentioning
confidence: 99%
“…Currently, the compiler can automatically select a static distribution of data (using a constraint-based algorithm [16]) specifying both the configuration of an abstract multi-dimensional mesh topology along with how program data should be distributed on the mesh. We have previously presented a technique [27] which extends the static partitioning algorithm to select dynamic data distributions which can further improve the performance of the resulting parallel program. In this paper we describe the implementation in more detail, introduce the idea of the hierarchical component affinity graph for performing interprocedural data partitioning, as well as present experimental results obtained using the current implementation.…”
mentioning
confidence: 99%