2011 IEEE International Solid-State Circuits Conference 2011
DOI: 10.1109/isscc.2011.5746228
|View full text |Cite
|
Sign up to set email alerts
|

40-Entry unified out-of-order scheduler and integer execution unit for the AMD Bulldozer x86–64 core

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
11
0

Year Published

2011
2011
2020
2020

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 19 publications
(11 citation statements)
references
References 3 publications
0
11
0
Order By: Relevance
“…These buses are {2 (read)+1 (write)}×5 (unit) = 15 tracks of 8-bit wire bundles. These wires are routed in an upper layer, whose pitch is twice as thick as that in the lower layers used for RAMs [11] and switches. Thus, 15 × 2 = 30 tracks can be used within this width for the RAMs and switches.…”
Section: -Bit Slicementioning
confidence: 99%
“…These buses are {2 (read)+1 (write)}×5 (unit) = 15 tracks of 8-bit wire bundles. These wires are routed in an upper layer, whose pitch is twice as thick as that in the lower layers used for RAMs [11] and switches. Thus, 15 × 2 = 30 tracks can be used within this width for the RAMs and switches.…”
Section: -Bit Slicementioning
confidence: 99%
“…Yet, because we do not implement the whole NoSQ infrastructure and since we did not tune the Distance predictor to handle memory dependency predictions well, we assume the presence of enough sources in the scheduler, which is optimistic but not unrealistic. For instance, AMD Bulldozer's scheduler supports 4 sources per instruction [17], which would be enough for both additional dependencies.…”
Section: Validationmentioning
confidence: 99%
“…This is roughly two times fewer entries than Haswell's scheduler. Assuming a baseline CAM-like scheduler and 6/8 results per cycle, then each entry of the scheduler must provision 12/16 comparators for wakeup, assuming 2 operands per entry (consider that AMD Bulldozer's actually has up to 4 per entry [14]). The speculative window only requires as many comparators per entry as there are blocks fetched per cycle (granted that the comparators are bigger since we match 15 bits instead of [6][7][8].…”
Section: A Remark On Power and Energymentioning
confidence: 99%