1991
DOI: 10.1145/106975.106981
|View full text |Cite
|
Sign up to set email alerts
|

The cache performance and optimizations of blocked algorithms

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

2
207
0

Year Published

1997
1997
2017
2017

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 134 publications
(209 citation statements)
references
References 6 publications
2
207
0
Order By: Relevance
“…Conflict misses [20] may occur when too many data items map to the same set of cache locations, causing cache lines to be flushed from cache before they may be used, despite sufficient capacity in the overall cache. As a result, in addition to eliminating capacity misses [11], [23] and maximizing cache utilization, the tile should be selected in such a way that there are no (or few) self conflict misses, while cross conflict misses are minimized [3], [4], [5], [10], [17].…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations
“…Conflict misses [20] may occur when too many data items map to the same set of cache locations, causing cache lines to be flushed from cache before they may be used, despite sufficient capacity in the overall cache. As a result, in addition to eliminating capacity misses [11], [23] and maximizing cache utilization, the tile should be selected in such a way that there are no (or few) self conflict misses, while cross conflict misses are minimized [3], [4], [5], [10], [17].…”
Section: Related Workmentioning
confidence: 99%
“…To model self conflict misses due to low associativity cache, [24] and [12] use the effective cache size q × C (q < 1), instead of the actual cache size C, while [3], [4], [10] and [19] explicitly find the non-conflicting tile sizes. Taking into account cache line size as well, column dimensions (without loss of generality, assume a column major data array layout) should be a multiple of the cache line size [4].…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Wolf et al [34] consider the integrated treatment of fusion and tiling only from the point of view of enhancing locality and do not consider the impact of the amount of required memory; the memory requirement is a key issue for the problems considered in this paper. Loop tiling for enhancing data locality has been studied extensively [27,33,30], and analytic models of the impact of tiling on locality have been developed [7,20,25]. Recently, a data-centric version of tiling called data shackling has been developed [12,13] (together with more recent work by Ahmed et al [1]) which allows a cleaner treatment of locality enhancement in imperfectly nested loops.…”
Section: Related Workmentioning
confidence: 99%