Circular Shift of Loop Body - Programme Transformation, Promoting Parallelism

Steinberg, Oleg

doi:10.14529/mmp170310

Cited by 6 publications

(2 citation statements)

References 3 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…A detailed description of the numerical algorithms SPH and N-body is given in papers [28,29]. In the papers [30,31] it was also noted that the result of the parallel software implementation and its efficiency depends on the details of the code and on the sequence of the numerical operations. Fig.…”

Section: Numerical Code and Parametersmentioning

confidence: 99%

Numerical Modelling of the Dynamics of the Galactic Halos in the Colliding Galaxies

Khrapov¹,

Хоперсков²,

Корчагин³

2019

Bulletin SUSU MMP

View full text Add to dashboard Cite

Based on parallel three-dimensional simulation of N-body and gas self-consistent dynamics, we study the behavior of hot coronal gas in the colliding galaxies with "live" dark matter halos. We model a few scenarios of the galactic collisions including "bull-eye" and non-central ones, and use different values of the initial velocities of the colliding galaxies. Taking into account the self-gravity, we demonstrate that the collision of gaseous and stellar components does not lead to the formation of a gaseous "protogalaxy" observed in some numerical simulations. Also, we show that about sixty percent of hot halo gas is expelled into intergalactic space during the collision. Numerical simulations show that a considerable amount of gas (up to 70% for a bull-eye collisions) exchanges between two colliding galaxies.

show abstract

Section: Numerical Code and Parametersmentioning

confidence: 99%

Numerical Modelling of the Dynamics of the Galactic Halos in the Colliding Galaxies

Khrapov¹,

Хоперсков²,

Корчагин³

2019

Bulletin SUSU MMP

View full text Add to dashboard Cite

show abstract

“…In computational astrophysics, the problem of the software transfer to new hardware platforms becomes relevant due to the wide distribution of powerful computer systems on graphics processors. The result of the parallel software implementation and its efficiency depend a lot on the features of the code and the sequence of numerical operations [13,14].…”

Section: Introductionmentioning

confidence: 99%

New Features of Parallel Implementation of N-Body Problems on GPU

Khrapov¹,

Khoperskov²,

Хоперсков³

2018

Bulletin of the SUSU. MMP

View full text Add to dashboard Cite

This paper focuses on the parallel implementation of a direct N -body method (particleparticle algorithm) and the application of multiple GPUs for galactic dynamics simulations. Application of a hybrid OpenMP-CUDA technology is considered for models with a number of particles N ∼ 10 5 ÷ 10 7 . By means of N -body simulations of gravitationally unstable stellar galactic we have investigated the algorithms parallelization efficiency for various Nvidia Tesla graphics processors (K20, K40, K80). Particular attention was paid to the parallel performance of simulations and accuracy of the numerical solution by comparing single and double floating-point precisions (SP and DP). We showed that the doubleprecision simulations are slower by a factor of 1.7 than the single-precision runs performed on Nvidia Tesla K-Series processors. We also claim that application of the single-precision operations leads to incorrect result in the evolution of the non-axisymmetric gravitating N -body systems. In particular, it leads to significant quantitative and even qualitative distortions in the galactic disk evolution. For instance, after 10 4 integration time steps for the single-precision numbers the total energy, momentum, and angular momentum of a system with N = 2 20 conserve with accuracy of 10 −3 , 10 −2 and 10 −3 respectively, in comparison to the double-precision simulations these values are 10 −5 , 10 −15 and 10 −13 , respectively. Our estimations evidence in favour of usage of the second-order accuracy schemes with double-precision numbers since it is more efficient than in the fourth-order schemes with single-precision numbers.

show abstract