Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis 2017
DOI: 10.1145/3126908.3126952
|View full text |Cite
|
Sign up to set email alerts
|

Embracing a new era of highly efficient and productive quantum Monte Carlo simulations

Abstract: QMCPACK has enabled cutting-edge materials research on supercomputers for over a decade. It scales nearly ideally but has low single-node e ciency due to the physics-based abstractions using array-of-structures objects, causing ine cient vectorization. We present a systematic approach to transform QMCPACK to better exploit the new hardware features of modern CPUs in portable and maintainable ways. We develop miniapps for fast prototyping and optimizations. We implement new containers in structure-of-arrays dat… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
5
1

Relationship

2
4

Authors

Journals

citations
Cited by 6 publications
(7 citation statements)
references
References 11 publications
0
7
0
Order By: Relevance
“…This also reduces the memory requirements of the application. We have recently completed extensive analysis and reimplementation of the core compute kernels of the application, more than doubling the speed of many calculations on modern multicore processors [37]. The performance obtained for several key kernels is shown in figure 2.…”
Section: Performance and Parallel Scalingmentioning
confidence: 99%
“…This also reduces the memory requirements of the application. We have recently completed extensive analysis and reimplementation of the core compute kernels of the application, more than doubling the speed of many calculations on modern multicore processors [37]. The performance obtained for several key kernels is shown in figure 2.…”
Section: Performance and Parallel Scalingmentioning
confidence: 99%
“…As the two functions can give slightly different numerical results, care must be taken to be sure to always have the same numerical results than the Rel. 4. Limit the use of Fortran intrinsics The use of Fortran intrinsics (SUM, PACK, etc...) is a common approach for vectorization in Molecular Dynamics package.…”
Section: Computational Hotspotsmentioning
confidence: 99%
“…In a recent study, Watanabe and Nakagawa [25] have obtained a boost factor between 1. 4 the data setup, for the vectorization of the Lennard-Jones potential on AVX2 and AVX-512 architectures. So, achieving comparable, and often superior, boost factors on all the vectorized routines of Tinker-HP seems quite satisfactory.…”
Section: Computational Hotspotsmentioning
confidence: 99%
See 2 more Smart Citations