ACM/IEEE SC 2005 Conference (SC'05)
DOI: 10.1109/sc.2005.35
|View full text |Cite
|
Sign up to set email alerts
|

Integrated Loop Optimizations for Data Locality Enhancement of Tensor Contraction Expressions

Abstract: A very challenging issue for optimizing compilers is the phase ordering problem: In what order should a collection of compiler optimizations be performed? We address this problem in the context of optimizing a sequence of tensor contractions. The pertinent loop transformations are loop permutation, tiling, and fusion; in addition, the placement of disk I/O statements crucially affects performance. The space of possible combinations is exponentially large. We develop novel pruning strategies whereby a search pr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
10
0

Publication Types

Select...
3
2

Relationship

3
2

Authors

Journals

citations
Cited by 9 publications
(10 citation statements)
references
References 40 publications
0
10
0
Order By: Relevance
“…The key decisions to be made are 1) determining the loop structure, comprising of fused and tiled loops, 2) the placement of memory allocation and data movement statements for each of the memory hierarchy levels, and 3) the tile sizes. It has been shown previously that the the space of possible choices is exponentially large [33]. Besides considering additional levels in the memory hierarchy in determining fusion structures, we focus on the prescriptive approaches to choosing candidate loop fusions without requiring expensive optimization procedures.…”
Section: Problem Statement and Notationmentioning
confidence: 99%
See 4 more Smart Citations
“…The key decisions to be made are 1) determining the loop structure, comprising of fused and tiled loops, 2) the placement of memory allocation and data movement statements for each of the memory hierarchy levels, and 3) the tile sizes. It has been shown previously that the the space of possible choices is exponentially large [33]. Besides considering additional levels in the memory hierarchy in determining fusion structures, we focus on the prescriptive approaches to choosing candidate loop fusions without requiring expensive optimization procedures.…”
Section: Problem Statement and Notationmentioning
confidence: 99%
“…from disks to main memory). A detailed description is available from an earlier publication [33]. In a tensor contraction, the indices can be grouped into those that are contracted and the remaining indices in the two input tensors.…”
Section: Single Contraction Optimizationmentioning
confidence: 99%
See 3 more Smart Citations