2021
DOI: 10.1109/mm.2021.3058632
|View full text |Cite
|
Sign up to set email alerts
|

IBM's POWER10 Processor

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
15
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 19 publications
(15 citation statements)
references
References 5 publications
0
15
0
Order By: Relevance
“…System Setup. Our POWER10 system has one single-chip module (SCM) with 15 SMT8 cores, equivalent to 30 SMT4 cores [41]. Each SMT4 core has 32 KB private L1 and 1MB private L2 caches.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…System Setup. Our POWER10 system has one single-chip module (SCM) with 15 SMT8 cores, equivalent to 30 SMT4 cores [41]. Each SMT4 core has 32 KB private L1 and 1MB private L2 caches.…”
Section: Methodsmentioning
confidence: 99%
“…These specialized units can execute dense matrix-multiply operations on small matrices (e.g., blocks of size 4 × 4). Examples of these are NVIDIA's Tensor Cores [2,5], IBM's POWER10 Matrix-Multiply Assist (MMA) facilities [12,41], and Intel's AMX [6]. These units are successfully utilized to maximize performance for dense matrix operations [2,5,32].…”
Section: Introductionmentioning
confidence: 99%
“…In addition, custom hardware for memory sharing relies on emerging open standards, such as Open Memory Interface (OMI) and Compute Express Link (CXL). IBM's Power10 has a memory clustering feature called memory inception, which allows one PM to map its address space to the physical memory of another PM by leveraging OMI memory [37]. It allows a PM to run large memory workloads that go beyond its capacity by borrowing memory from other PMs.…”
Section: Commercial Alternative Solutionsmentioning
confidence: 99%
“…This is inline with the throughput of the two matrix pipelines being double that of the four vector pipes. For more performance results on both HPL and ResNet-50 (also 4× the per core performance of POWER9), we refer the reader to [21]. We take a closer look at the performance of the 128 × 128 DGEMM kernel in Figure 11.…”
Section: Performance Measurementsmentioning
confidence: 99%
“…The IBM POWER10 processor [20], [21] is the compute engine for the next generation of Power Systems and successor to the current POWER9 [16], [19] processor. As such, it has to offer superior performance on applications of interest to Power Systems users.…”
Section: Introductionmentioning
confidence: 99%