2013
DOI: 10.1002/cpe.3046
|View full text |Cite
|
Sign up to set email alerts
|

CPU–GPU hybrid parallel strategy for cosmological simulations

Abstract: SUMMARYGadget is a simulation application for N‐body and smoothed particle hydrodynamics problems in cosmology, and it is widely applied in solving series of cosmological problems. N‐body focuses on the motion of the interaction of N particles, and smoothed particle hydrodynamics is a fluid simulation algorithm that studies the movement of fluid through particle simulation. Most scholars focus their attention on accelerating Gadget on multi‐core CPU or graphics processing units (GPUs) platforms. However, these… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

1
13
0

Year Published

2014
2014
2019
2019

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 10 publications
(14 citation statements)
references
References 19 publications
1
13
0
Order By: Relevance
“…The increase is 40.1 times (92 days was reduced to 2 days 7 hours) and it shows the increase change with the problem scale [20] . Some practical conclusions can be drawn under the simulations of different parameters.…”
Section: Simulation Results and Discussionmentioning
confidence: 93%
“…The increase is 40.1 times (92 days was reduced to 2 days 7 hours) and it shows the increase change with the problem scale [20] . Some practical conclusions can be drawn under the simulations of different parameters.…”
Section: Simulation Results and Discussionmentioning
confidence: 93%
“…The further optimization of the performance of communication between threads in shared memory mode such as GPGPU architecture 19 will be our next work. The further optimization of the performance of communication between threads in shared memory mode such as GPGPU architecture 19 will be our next work.…”
Section: Conclusion and Further Workmentioning
confidence: 99%
“…15 A hybrid MPI/OpenMP approach is applied to parallel geo-computation using multi-core computers for two-level parallelization to implement process-level and thread-level parallelization for the higher speed-up ratio. 19 An efficient modified R3 algorithm is designed, which executed on the GPU by a two-level spatial domain decomposition strategy. 8,17,18 The parallel computing technique based on the GPU is utilized to perform viewshed analysis more efficiently in some cases.…”
mentioning
confidence: 99%
See 1 more Smart Citation
“…CUBLAS used both CPUs and GPUs to compute different parts of a dense matrix. Wang extended the cosmology simulation package Gadget2 with the CPU–GPU hybrid parallel strategy. The short‐range force calculation was split between CPUs and GPU, while communication and the hydrodynamics computation were performed on CPUs.…”
Section: Introductionmentioning
confidence: 99%