Proceedings of the 1991 ACM/IEEE Conference on Supercomputing - Supercomputing '91 1991
DOI: 10.1145/125826.125932
|View full text |Cite
|
Sign up to set email alerts
|

An effective on-chip preloading scheme to reduce data access penalty

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
191
0
4

Year Published

2000
2000
2007
2007

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 334 publications
(196 citation statements)
references
References 10 publications
1
191
0
4
Order By: Relevance
“…Figure 2 shows the cumulative distribution of L1 block dead-times-the number of cycles between a last touch to a block until its eventual eviction. 1 The figure corroborates prior results [12,13,26], showing that over 85% of all cache-block dead-times are longer than the memory access latency. Therefore, prefetch requests issued at last touch can complete before the next access to the same cache index, eliminating the entire off-chip miss latency.…”
Section: Background: Dbcp Prefetchingsupporting
confidence: 88%
See 2 more Smart Citations
“…Figure 2 shows the cumulative distribution of L1 block dead-times-the number of cycles between a last touch to a block until its eventual eviction. 1 The figure corroborates prior results [12,13,26], showing that over 85% of all cache-block dead-times are longer than the memory access latency. Therefore, prefetch requests issued at last touch can complete before the next access to the same cache index, eliminating the entire off-chip miss latency.…”
Section: Background: Dbcp Prefetchingsupporting
confidence: 88%
“…Temporal correlation distance between two consecutive misses is the distance between the previous occurrence of the same two misses in the sequence of all cache misses. 1 A temporal correlation distance of +1 implies perfect correlation, where the two most recent occurrences of a pair of consecutive misses appeared in exactly the same order; a distance of -1 corresponds to a reversal of the two misses compared to their previous occurrence, as in the sequence {A,B,...,B,A}. Figure 6 (left) shows absolute temporal correlation distances as a cumulative distribution of all cache misses.…”
Section: Temporal Correlation Opportunitymentioning
confidence: 99%
See 1 more Smart Citation
“…These techniques may also require the quantification of the different types of cache misses, traditionally noted as compulsory, capacity and conflict misses. For instance, compulsory misses can be avoided through prefetching, both hardware [2] and software [6]. Blocking or tiling is used to avoid capacity misses [3], whereas some examples of techniques to reduce the effect of conflict misses are copying [11] and padding [7].…”
Section: Introductionmentioning
confidence: 99%
“…RAMBUS and synchronous DRAM's utilize a form of on-chip caching and even more aggressive approaches have been proposed 3]. Latency hiding techniques, primarily prefetching 6,7,8], also have been utilized to help solve the problem.…”
Section: Introductionmentioning
confidence: 99%