Proceedings 2001 International Conference on Parallel Architectures and Compilation Techniques
DOI: 10.1109/pact.2001.953299
|View full text |Cite
|
Sign up to set email alerts
|

Cache-friendly implementations of transitive closure

Abstract: Abstract

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
23
0
2

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 18 publications
(25 citation statements)
references
References 9 publications
0
23
0
2
Order By: Relevance
“…In this paper, to make our approach intuitive, instead of getting into intricate details of the definitions of dependency graphs, we use an example of a dependency graph to describe our ideas.As the properties used in the example are generic to dependency graphs and also through the case studies, it will be clear that our approach can be applied to any dependency graph. We chose the systolic array design for transitive closure( [2], [4]) as the example. So we briefly describe the problem and its systolic array design below.…”
Section: Dependency Graphsmentioning
confidence: 99%
See 2 more Smart Citations
“…In this paper, to make our approach intuitive, instead of getting into intricate details of the definitions of dependency graphs, we use an example of a dependency graph to describe our ideas.As the properties used in the example are generic to dependency graphs and also through the case studies, it will be clear that our approach can be applied to any dependency graph. We chose the systolic array design for transitive closure( [2], [4]) as the example. So we briefly describe the problem and its systolic array design below.…”
Section: Dependency Graphsmentioning
confidence: 99%
“…Each macro node has a micro dependency graph (one macro node with micro nodes) inside it. While this abstraction some similarity to tiling ( [4]), this idea can be more easily extended to multiple levels of hierarchies, for new generation high performance computing and supercomputing systems with various levels of parallelism and compute power organization.…”
Section: Formalizing the Abstractionmentioning
confidence: 99%
See 1 more Smart Citation
“…However, it is beneficial to find an estimate of the optimal tile size. A block size selection heuristic for finding this estimate is discussed in [34], and outlined here. • Use the 2:1 rule of thumb from [32] to adjust the cache size to that of an equivalent 4-way set associative cache.…”
Section: Inductionmentioning
confidence: 99%
“…It has been shown by a number of groups that data layouts tuned to the data access pattern of the algorithm can reduce both TLB and cache misses [7][31] [34]. In the case of the recursive algorithm, the access pattern is matched by a Z-Morton data layout.…”
Section: Data Layout Issuesmentioning
confidence: 99%