2000
DOI: 10.1016/s0167-8191(00)00040-5
|View full text |Cite
|
Sign up to set email alerts
|

Generating efficient tiled code for distributed memory machines

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2002
2002
2016
2016

Publication Types

Select...
4
3
1

Relationship

2
6

Authors

Journals

citations
Cited by 11 publications
(3 citation statements)
references
References 38 publications
0
3
0
Order By: Relevance
“…Tang and Xue [19] presented a method for tiling SOR by applying skewing and tiling for distributed memory machines. Goumas et al [4] later continued this line of investigation by focusing on the parallelization of 2D iteration spaces that result from the discretization of PDEs.…”
Section: A Parallel Mlssor Methods For Gpgpusmentioning
confidence: 99%
“…Tang and Xue [19] presented a method for tiling SOR by applying skewing and tiling for distributed memory machines. Goumas et al [4] later continued this line of investigation by focusing on the parallelization of 2D iteration spaces that result from the discretization of PDEs.…”
Section: A Parallel Mlssor Methods For Gpgpusmentioning
confidence: 99%
“…Tang and Xue in [22] addressed the same issues for rectangularly tiled iteration spaces. We can also generate efficient data parallel code for non-rectangular tiles without imposing any further complexity.…”
Section: Data Parallel Code Generationmentioning
confidence: 98%
“…Loop tiling is beneficial for parallel computers with hierarchical memory: computers with both shared and distributed memory (Xue, 2012;Tang and Xue, 2000) as…”
Section: Introductionmentioning
confidence: 99%