1994
DOI: 10.1006/jpdc.1994.1104
|View full text |Cite
|
Sign up to set email alerts
|

Communication Optimizations for Irregular Scientific Computations on Distributed Memory Architectures

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
76
0

Year Published

1995
1995
2016
2016

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 166 publications
(76 citation statements)
references
References 2 publications
0
76
0
Order By: Relevance
“…Previous work on inspectors in the context of languages such as HPF and Titanium [8,1,14] have suggested reordering loop iterations to differentiate local and non-local accesses. Consider a common sparse matrix-vector product shown in Figure 2, taken from the NAS Conjugate Gradient benchmark.…”
Section: Figure 4 Restructuring Of Sparse Matrixvector Multiplicatiomentioning
confidence: 99%
See 1 more Smart Citation
“…Previous work on inspectors in the context of languages such as HPF and Titanium [8,1,14] have suggested reordering loop iterations to differentiate local and non-local accesses. Consider a common sparse matrix-vector product shown in Figure 2, taken from the NAS Conjugate Gradient benchmark.…”
Section: Figure 4 Restructuring Of Sparse Matrixvector Multiplicatiomentioning
confidence: 99%
“…This execution scheme could also add significant overhead to serial sections, as these needed be executed on multiple nodes owning parts of the data. Handling irregular data was difficult and usually employed runtime schemes [8]. In contrast to HPF implementations, our execution model starts from the available parallelism specified through OpenMP directives.…”
Section: Related Workmentioning
confidence: 99%
“…Pingali et al manually regrouped work in integer computations, including tree traversals [17]. Dynamic regrouping is also part of the inspector-executor model, originally studied by Saltz and others for partitioning irregular computations on distributed-memory parallel machines [5]. Most of these studies considered data transformation in conjunction with computation reordering.…”
Section: Related Workmentioning
confidence: 99%
“…Initially, such transformations were incorporated into applications manually for parallelism [15]. Next, libraries with run-time transformation primitives were developed so that a programmer or compiler could insert calls to such primitives [16,55]. Currently, there are run-time reordering transformations for which a compiler can automatically analyze and generate the inspectors [74,19,40,26].…”
Section: Introductionmentioning
confidence: 99%