2009
DOI: 10.1147/jrd.2009.5388586
|View full text |Cite
|
Sign up to set email alerts
|

Design and microarchitecture of the IBM System z10 microprocessor

Abstract: The IBM System z10e microprocessor is currently the fastest running 64-bit CISC (complex instruction set computer) microprocessor. This microprocessor operates at 4.4 GHz and provides up to two times performance improvement compared with its predecessor, the System z9t microprocessor. In addition to its ultrahigh-frequency pipeline, the z10e microprocessor offers such performance enhancements as a sophisticated branch-prediction structure, a large second-level private cache, a data-prefetch engine, and a hardw… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
9
0

Year Published

2009
2009
2015
2015

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 15 publications
(9 citation statements)
references
References 13 publications
0
9
0
Order By: Relevance
“…Starting with the POWER4 processor [12] and z10* [13] processors, the IBM I/O subsystem was built upon a proprietary ecosystem of I/O chips [14] attached to IBM's proprietary GX bus and later GX+ and GX++ buses to achieve higher I/O throughput than possible with the standardized interfaces available at the time. Different break-out I/O hub chips provided connectivity to PCI-X**, PCIe devices, and InfiniBand** or Ethernet fabrics.…”
Section: I/o Subsystemmentioning
confidence: 99%
“…Starting with the POWER4 processor [12] and z10* [13] processors, the IBM I/O subsystem was built upon a proprietary ecosystem of I/O chips [14] attached to IBM's proprietary GX bus and later GX+ and GX++ buses to achieve higher I/O throughput than possible with the standardized interfaces available at the time. Different break-out I/O hub chips provided connectivity to PCI-X**, PCIe devices, and InfiniBand** or Ethernet fabrics.…”
Section: I/o Subsystemmentioning
confidence: 99%
“…Figure 4 shows the pipeline stages of the z196 processor from the first decode stage (D1) to completion. In the prior z10 microprocessor pipeline design [2], the fixed-point unit (FXU) execution stage is placed after the cache data return stage of the LSU; hence, there is no need to stall the execution of an FXU instruction if it is dependent on a load data event. However, this caused results of the RR-type instructions 1 to go through many nonrequired stages (e.g., LSU pipe stages A0-A3), delaying the availability of results and eventually delaying the branch resolution cycle.…”
Section: Processor Microarchitecturementioning
confidence: 99%
“…Together, these features enable the platform to better support the traditional Large Systems Performance Reference (LSPR) workload set, to better utilize the hardware/software synergy that emerged on the preceding z10 platform, and to provide a highly competitive platform for emerging workloads while continuing the trend for increased box capacity within a constant energy footprint from one generation to the next. The sections that follow contain a comparative analysis between the z196 platform and its predecessor z10 platform [2,3].…”
Section: Introductionmentioning
confidence: 99%
“…Over several generations of the IBM z Systems, more parallelism has been achieved by adding processor cores, and by increasing instruction-level parallelism [1,2]. Multiple threads, capable of running independent instruction streams on one processor core, increase parallelism even further.…”
Section: Introductionmentioning
confidence: 99%