Proceedings of the 48th International Conference on Parallel Processing 2019
DOI: 10.1145/3337821.3337845
|View full text |Cite
|
Sign up to set email alerts
|

Gravitational Octree Code Performance Evaluation on Volta GPU

Abstract: In this study, the gravitational octree code originally optimized for the Fermi, Kepler, and Maxwell GPU architectures is adapted to the Volta architecture. The Volta architecture introduces independent thread scheduling requiring either the insertion of the explicit synchronizations at appropriate locations or the enforcement of the same implicit synchronizations as do the Pascal or earlier architectures by specifying -gencode arch=compute 60,code=sm 70. The performance measurements on Tesla V100, the current… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 23 publications
0
1
0
Order By: Relevance
“…To address this issue and mitigate the imbalance in computation and communication as much as possible, especially in scenarios involving largescale and long-duration phenomena, we adopted the dynamic load balancing (DLB) technique proposed by [47] in this study. The implementation was initially developed for the gravitational octree code GOTHIC [48,49] and adjusted for the MPM in this study. Domain decomposition in particle methods generally does not necessarily concern the geometry of each domain.…”
Section: Dynamic Load Balancing (Dlb) Techniquementioning
confidence: 99%
“…To address this issue and mitigate the imbalance in computation and communication as much as possible, especially in scenarios involving largescale and long-duration phenomena, we adopted the dynamic load balancing (DLB) technique proposed by [47] in this study. The implementation was initially developed for the gravitational octree code GOTHIC [48,49] and adjusted for the MPM in this study. Domain decomposition in particle methods generally does not necessarily concern the geometry of each domain.…”
Section: Dynamic Load Balancing (Dlb) Techniquementioning
confidence: 99%