14th International Conference on Parallel Architectures and Compilation Techniques (PACT'05) 2005
DOI: 10.1109/pact.2005.23
|View full text |Cite
|
Sign up to set email alerts
|

Future execution: a hardware prefetching technique for chip multiprocessors

Abstract: This paper proposes a new hardware technique for using one core of a CMP to prefetch data for a thread running on another core. Our approach simply executes a copy of all non-control instructions in the prefetching core after they have executed in the primary core. On the way to the second core, each instruction's output is replaced by a prediction of the likely output that the n th future instance of this instruction will produce. Speculatively executing the resulting instruction stream on the second core iss… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
47
0

Year Published

2006
2006
2014
2014

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 46 publications
(47 citation statements)
references
References 31 publications
0
47
0
Order By: Relevance
“…Some polynomial formula is used to predict the future access for general cases. For example, if a constant difference is found in the third depth, the future access is predicted as Figure 3 shows an example where a complex structure pattern (4,8,4,8) is detected when we perform the MLDT prefetching with the DAHC.…”
Section: Aggressive Prefetching Strategiesmentioning
confidence: 99%
See 1 more Smart Citation
“…Some polynomial formula is used to predict the future access for general cases. For example, if a constant difference is found in the third depth, the future access is predicted as Figure 3 shows an example where a complex structure pattern (4,8,4,8) is detected when we perform the MLDT prefetching with the DAHC.…”
Section: Aggressive Prefetching Strategiesmentioning
confidence: 99%
“…Other recent efforts in hardware prefetching include Zhou's dual-core execution (DCE) approach [23] , Ganusov et al's future execution (FE) approach [8] , Sun et al's data push server architecture [21] and Solihin et al's memory-side prefetching [20] .…”
Section: Related Workmentioning
confidence: 99%
“…Next, input matching starts. This process is shown as (1)... (8) in Fig.4. First, the memoization engine reads the value of program counter (PC) and registers.…”
Section: B Input Matchingmentioning
confidence: 99%
“…This extension can omit the execution of instruction regions whose inputs show monotonous increase/decrease. These SpMT cores not only omit some execution, but also works as a cache prefetch technique [7], [8].…”
Section: Parallel Speculative Executionmentioning
confidence: 99%
“…Many of these methods run a helper thread ahead of actual execution of an application to predict cache misses [10−15] . Another set of methods employ run-ahead execution at hardware level [16,17] , where idle or dedicated cycles are used for prefetching. We proposed to utilize a dedicated server to push data closer to CPU by selecting future data access prediction methods dynamically [18] .…”
Section: Introductionmentioning
confidence: 99%