2020 IEEE International Symposium on High Performance Computer Architecture (HPCA) 2020
DOI: 10.1109/hpca47549.2020.00040
|View full text |Cite
|
Sign up to set email alerts
|

Precise Runahead Execution

Abstract: Runahead execution improves processor performance by accurately prefetching long-latency memory accesses. When a long-latency load causes the instruction window to fill up and halt the pipeline, the processor enters runahead mode and keeps speculatively executing code to trigger accurate prefetches. A recent improvement tracks the chain of instructions that leads to the long-latency load, stores it in a runahead buffer, and executes only this chain during runahead execution, with the purpose of generating more… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
14
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
1

Relationship

2
4

Authors

Journals

citations
Cited by 16 publications
(14 citation statements)
references
References 85 publications
0
14
0
Order By: Relevance
“…When the long-latency operation completes, the instructions are reinserted from the WIB into the issue queue. Runahead execution [14,[25][26][27] removes the blocking cache miss from the instruction window and continues to speculatively prefetch future memory addresses till the blocking miss returns. Continuous Flow Pipelines (CFP) [37] build on top of the CheckPoint and Renaming (CPR) proposal [1], releasing scheduler and register file resources for off-chip load-dependent instruction slices.…”
Section: Related Workmentioning
confidence: 99%
“…When the long-latency operation completes, the instructions are reinserted from the WIB into the issue queue. Runahead execution [14,[25][26][27] removes the blocking cache miss from the instruction window and continues to speculatively prefetch future memory addresses till the blocking miss returns. Continuous Flow Pipelines (CFP) [37] build on top of the CheckPoint and Renaming (CPR) proposal [1], releasing scheduler and register file resources for off-chip load-dependent instruction slices.…”
Section: Related Workmentioning
confidence: 99%
“…Precise Runahead Execution (PRE) [63,64], the state-ofthe-art in runahead execution, improves upon standard runahead through three key mechanisms. (1) PRE leverages the available back-end (issue queue and physical register file) resources to speculatively execute instructions in runahead mode, thereby eliminating the need to release and flush processor state when entering and exiting runahead mode.…”
Section: B Limitations Of Runahead Techniquesmentioning
confidence: 99%
“…To achieve the instruction-level visibility necessary to calculate the addresses of complex access patterns seen in today's workloads [3], we conclude that this ideal technique must operate within the core, instead of within the cache. Runahead execution [25,32,34,57,58,64] is the most promising technique to date, where upon a memory stall at the head of the reorder buffer (ROB), execution enters a speculative 'runahead' mode designed to prefetch future memory accesses. In runahead mode, the addresses of future memory accesses are calculated and the memory accesses are speculatively issued.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations