2008
DOI: 10.2197/ipsjtrans.1.94
|View full text |Cite
|
Sign up to set email alerts
|

Two-Step Physical Register Deallocation for Data Prefetching and Address Pre-Calculation

Abstract: This paper proposes an instruction pre-execution scheme for a high performance processor, that reduces latency and early scheduling of loads. Our scheme exploits the difference between the amount of instruction-level parallelism available with an unlimited number of physical registers and that available with an actual number of physical registers. We introduce the two-step physical register deallocation scheme, which deallocates physical registers at the renaming stage as a first step, and eliminates pipeline … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
19
0

Year Published

2009
2009
2016
2016

Publication Types

Select...
4
2

Relationship

5
1

Authors

Journals

citations
Cited by 8 publications
(19 citation statements)
references
References 41 publications
0
19
0
Order By: Relevance
“…However, several studies that do not need a multithreaded environment have been carried out more recently [20]- [24]. These studies commonly pre-execute instructions when the instruction window resources run out.…”
Section: Studies Considering Power On Mlp Exploitationmentioning
confidence: 99%
“…However, several studies that do not need a multithreaded environment have been carried out more recently [20]- [24]. These studies commonly pre-execute instructions when the instruction window resources run out.…”
Section: Studies Considering Power On Mlp Exploitationmentioning
confidence: 99%
“…Yamamoto et al proposed a scheme called two-step physical register deallocation (TSD), which allows preexecution for MLP exploitation [30], [31]. This scheme temporarily deallocates a physically register at the rename stage, and enables its reuse by another instruction temporarily.…”
Section: Exploiting Mlpmentioning
confidence: 99%
“…Although this scheme is feasible, it is still complicated. Furthermore, it imposes a considerable cycle count penalty due to register reallocation, completely eliminating any potential benefit from late register allocation [2].…”
Section: Reducing Register Filementioning
confidence: 99%
“…This type of register renaming is, for example, implemented in the MIPS R10000 [26] and the Digital Equipment Alpha 21264 [27]. In this section, we present the TSD scheme [1], [2] that forms the basis of our study. First we illustrate the effect of TSD, and then explain the scheme by describing both the basic TSD and its extension for pre-execution.…”
Section: Instruction Pre-execution Through Tsdmentioning
confidence: 99%
See 1 more Smart Citation