Proceedings of the 13th International Conference on Supercomputing 1999
DOI: 10.1145/305138.305175
|View full text |Cite
|
Sign up to set email alerts
|

Reducing branch misprediction penalties via dynamic control independence detection

Abstract: This paper presents the concept of dynamic control independence (DCl) and shows how it can be detected and exploited in an out-of-order superscalar processor to reduce the performance penalties of branch mispredictions. We show how DCI can be leveraged during branch misprediction recovery to reduce the number of instructions squashed on a misprediction as well as how it can be used to avoid predicting unpredictable branches by fetching instructions out-of-order A realistic implementation is described and evalu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
26
0

Year Published

2004
2004
2018
2018

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 34 publications
(29 citation statements)
references
References 7 publications
0
26
0
Order By: Relevance
“…Figure 6 shows the performance improvement of dynamichammock-predication, dual-path, multipath, and DMP over the baseline processor. The average IPC improvement over all benchmarks is 3.5% for dynamic-hammock-predication, 4.8% for dual-path, 8.8% for multipath, 11 and 19.3% for DMP. DMP improves the IPC by more than 20% on vpr (58%), mcf (47%), parser (26%), twolf (31%), compress (23%), and ijpeg (25%).…”
Section: Compiler Support For Diverge Branch and Cfm Point Selectionmentioning
confidence: 97%
See 1 more Smart Citation
“…Figure 6 shows the performance improvement of dynamichammock-predication, dual-path, multipath, and DMP over the baseline processor. The average IPC improvement over all benchmarks is 3.5% for dynamic-hammock-predication, 4.8% for dual-path, 8.8% for multipath, 11 and 19.3% for DMP. DMP improves the IPC by more than 20% on vpr (58%), mcf (47%), parser (26%), twolf (31%), compress (23%), and ijpeg (25%).…”
Section: Compiler Support For Diverge Branch and Cfm Point Selectionmentioning
confidence: 97%
“…Several hardware mechanisms were proposed to exploit control flow independence [35,36,11,8,16]. These techniques aim to avoid flushing the processor pipeline if the processor is known to be at a control-independent point in the program when a mispredicted branch is resolved.…”
Section: Related Work On Control Flow Independencementioning
confidence: 99%
“…Finally, techniques have been proposed to salvage some of the work performed on the incorrect control path via squash reuse [17], Control Independence [11,12,6], and Register Integration [13].…”
Section: Related Workmentioning
confidence: 99%
“…However, target prediction for indirect jumps is typically less accurate than conditional branch prediction, and may contribute a significant fraction of total control misspeculations. This heightens the importance of identifying reconvergence points for these instructions if it provides additional opportunity for techniques which mitigate misprediction costs [11,12,6,5].…”
Section: Survey Of Program Behaviormentioning
confidence: 99%
“…While such selective replay recovery mechanisms have been proposed [6], [10], most aggressive processors simply squash all instructions following a misspeculated branch due to the complexity of identifying control re-convergence and re-inserting instructions into the middle of the scheduling window.…”
Section: Fig 3 Comparison Of Parallelism Exploitable By Different Lmentioning
confidence: 99%