2020
DOI: 10.3390/computation8010020
|View full text |Cite
|
Sign up to set email alerts
|

ThunderX2 Performance and Energy-Efficiency for HPC Workloads

Abstract: In the last years, the energy efficiency of HPC systems is increasingly becoming of paramount importance for environmental, technical, and economical reasons. Several projects have investigated the use of different processors and accelerators in the quest of building systems able to achieve high energy efficiency levels for data centers and HPC installations. In this context, Arm CPU architecture has received a lot of attention given its wide use in low-power and energy-limited applications, but server grade p… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
8

Relationship

1
7

Authors

Journals

citations
Cited by 15 publications
(7 citation statements)
references
References 34 publications
0
7
0
Order By: Relevance
“…Certain errors could exist due to the accuracy of on-chip sensor for instantaneous readings reported in [57]. However, as our aim is to analyze only calculation energy cost, it is the only way for collecting on-chip component energy data [41], [58] and reported to be fairly accurate by NVIDIA (+-5% error rate [59]) and authors in [60], [61]. As the problem does not influence theoretical TOs/FLOPs, we will not discuss the accuracy of hardware energy monitoring.…”
Section: Methods Verificationmentioning
confidence: 99%
“…Certain errors could exist due to the accuracy of on-chip sensor for instantaneous readings reported in [57]. However, as our aim is to analyze only calculation energy cost, it is the only way for collecting on-chip component energy data [41], [58] and reported to be fairly accurate by NVIDIA (+-5% error rate [59]) and authors in [60], [61]. As the problem does not influence theoretical TOs/FLOPs, we will not discuss the accuracy of hardware energy monitoring.…”
Section: Methods Verificationmentioning
confidence: 99%
“…Calore et al measure power using tx2mon for computations running on a single ThunderX2 node to calculate energy efficiency of Lattice Boltzmann and Lattice quantum chromodynamics applications, but their experiments do not vary the frequency of the processor. 38 On the more memory bound of the two applications, they observe roughly half the energy usage along with superior performance using ThunderX2 compared to Intel Skylake. For the more compute bound of the two applications, they observe similar energy usage and about 20% less performance.…”
Section: Related Workmentioning
confidence: 99%
“…They obtained board level and processor hardware counter measurements during executions of the PARSEC and Splash‐2 benchmarks to show that energy costs of cache coherency are reasonable. Calore et al measure power using tx2mon for computations running on a single ThunderX2 node to calculate energy efficiency of Lattice Boltzmann and Lattice quantum chromodynamics applications, but their experiments do not vary the frequency of the processor 38 . On the more memory bound of the two applications, they observe roughly half the energy usage along with superior performance using ThunderX2 compared to Intel Skylake.…”
Section: Related Workmentioning
confidence: 99%
“…However, its short 128-bit vector size makes it less suitable for applications requiring intensive computation [17]. Despite this limitation, its high memory bandwidth makes it well-suited for memory-intensive applications [18].…”
Section: Arm-based Hpcmentioning
confidence: 99%