2010
DOI: 10.1109/tvlsi.2008.2012128
|View full text |Cite
|
Sign up to set email alerts
|

On the Latency and Energy of Checkpointed Superscalar Register Alias Tables

Abstract: Abstract-This paper investigates how the latency and energy of register alias tables (RATs) vary as a function of the number of global checkpoints (GCs), processor issue width, and window size. It improves upon previous RAT checkpointing work that ignored the actual latency and energy tradeoffs and focused solely on evaluating performance in terms of instructions per cycle (IPC). This work utilizes measurements from the full-custom checkpointed RAT implementations developed in a commercial 130-nm fabrication t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2011
2011
2016
2016

Publication Types

Select...
4
2

Relationship

2
4

Authors

Journals

citations
Cited by 6 publications
(11 citation statements)
references
References 30 publications
(37 reference statements)
0
11
0
Order By: Relevance
“…We did not evaluate the energy consumption of the comparators for dependency checking on register renaming [11], [13], but each comparator consists of a few transistors, and thus, we considered their energy consumption to be comparatively small. Moreover, our proposed method omits this dependency checking on an RTC hit, which results in a conservative energy comparison for our technique.…”
Section: Evaluation Environmentmentioning
confidence: 99%
See 2 more Smart Citations
“…We did not evaluate the energy consumption of the comparators for dependency checking on register renaming [11], [13], but each comparator consists of a few transistors, and thus, we considered their energy consumption to be comparatively small. Moreover, our proposed method omits this dependency checking on an RTC hit, which results in a conservative energy comparison for our technique.…”
Section: Evaluation Environmentmentioning
confidence: 99%
“…An RMT generally requires one write and three read ports per single 2-source operand instruction [11], [13]: 1) one write port for updating new destination mapping, 2) two read port for reading source operand mapping, and 3) one read port for reading old destination mapping. As a result, each instruction with 2-source operand requires four ports.…”
Section: Evaluated Modelsmentioning
confidence: 99%
See 1 more Smart Citation
“…For general ISAs with a three-operand format, an RMT requires 4 ports per instruction [17], [24]. For example, in a processor whose decode width is 8, such as IBM Power 8 [7], [2], a straightforward implementation of an RMT requires 32 ports 2 .…”
Section: A Problems Of Register Mapmentioning
confidence: 99%
“…We evaluated the energy consumption for register renaming by evaluating the arrays and their peripheral circuits related to our proposal, which are the RMT, TC, RTC and instruction cache. We do not evaluate the energy consumption of the comparators for dependency checking on register renaming [17], [24], but each comparator consists of few transistors, and thus, we think that their energy consumption is comparatively small. Moreover, our proposal omits this dependency checking on RTC hit, and consequently, this results in a conservative energy comparison for our technique.…”
Section: A Evaluation Environmentmentioning
confidence: 99%