Proceedings of the 2nd International Workshop on Hardware-Software Co-Design for High Performance Computing 2015
DOI: 10.1145/2834899.2834905
|View full text |Cite
|
Sign up to set email alerts
|

Performance and energy efficiency analysis of 64-bit ARM using GAMESS

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2015
2015
2021
2021

Publication Types

Select...
4
3

Relationship

3
4

Authors

Journals

citations
Cited by 13 publications
(8 citation statements)
references
References 25 publications
0
8
0
Order By: Relevance
“…The BNCH bin, which consists of counters that describe interaction of applications with the branch unit, ranks in the top three in both architectures. To understand why the branch unit is a bottleneck, we created a benchmark, similar to one created in [31], that loops around a branch which is controlled by an array of booleans. We initialize the size of the array to 32K booleans, and run the benchmark inside another loop 16K times.…”
Section: Performance Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…The BNCH bin, which consists of counters that describe interaction of applications with the branch unit, ranks in the top three in both architectures. To understand why the branch unit is a bottleneck, we created a benchmark, similar to one created in [31], that loops around a branch which is controlled by an array of booleans. We initialize the size of the array to 32K booleans, and run the benchmark inside another loop 16K times.…”
Section: Performance Resultsmentioning
confidence: 99%
“…Given the recent attention of ARM in the server and scientific communities, there has been a growth in research on ARM's power and performance capabilities [16,26,27,28,29,31].…”
Section: Related Workmentioning
confidence: 99%
“…Where possible the results from LMBench were confirmed using published specifications. Core and system memory and floating point throughput were measured using the CS Roofline Toolkit was used [13]. Measurements were taken for all cores and a single core across all systems to allow for comparison across the various architecture implementations.…”
Section: Methodsmentioning
confidence: 99%
“…This was subsequently used by Cavium in their ThunderX product that can be configured with a 48core ARMv8 processor. Some early quantum chemistry related performance results for this system were somewhat disappointing, indicating that this system is behind the performance of contemporary x86 processors. , …”
Section: Emerging Hardware Trendsmentioning
confidence: 99%