2009
DOI: 10.1007/s00450-009-0089-1
|View full text |Cite
|
Sign up to set email alerts
|

A novel multiple-walk parallel algorithm for the Barnes–Hut treecode on GPUs – towards cost effective, high performance N-body simulation

Abstract: Recently, general-purpose computation on graphics processing units (GPGPU) has become an increasingly popular field of study as graphics processing units (GPUs) continue to be proposed as high performance and relatively low cost implementation platforms for scientific computing applications. Among these applications figure astrophysical N-body simulations, which form one of the most challenging problems in computational science. However, in most reported studies, a simple O(N 2 ) algorithm was used for GPGPUs,… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
18
0

Year Published

2009
2009
2021
2021

Publication Types

Select...
6
2
1

Relationship

1
8

Authors

Journals

citations
Cited by 27 publications
(19 citation statements)
references
References 15 publications
1
18
0
Order By: Relevance
“…Therefore a method was developed to create lists of interacting particles on the host and then let the GRAPE solve the gravitational interactions [12,13]. Recently this method has successfully been applied to GPUs [14,15,16]. With the GPU being able to efficiently calculate the force interactions, other parts like the tree-construction and tree-traverse become the bottleneck of the application.…”
Section: Introductionmentioning
confidence: 99%
“…Therefore a method was developed to create lists of interacting particles on the host and then let the GRAPE solve the gravitational interactions [12,13]. Recently this method has successfully been applied to GPUs [14,15,16]. With the GPU being able to efficiently calculate the force interactions, other parts like the tree-construction and tree-traverse become the bottleneck of the application.…”
Section: Introductionmentioning
confidence: 99%
“…Moreover, nowadays the developments of GPU applications have moved beyond the single GPU stage, and both performance and parallel efficiency of the applications must be optimized. In astrophysical simulations, considerable performance speed-ups in multi-GPU systems have been demonstrated in a broad range of applications, for example, the direct N -body simulations (e.g., Schive et al 2008;Gaburov et al 2009;Spurzem et al 2011), Barnes-Hut tree algorithm (Hamada et al 2009), and reionization simulations (Aubert & Teyssier 2010). grid patches with an oct-tree data structure, and the relaxing total variation diminishing scheme (RTVD; Jin & Xin 1995) with directional splitting is adopted in the hydrodynamic solver.…”
Section: Introductionmentioning
confidence: 99%
“…During the past decade, the clock rates of Central Processing Units have been practically stagnant, while GPUs provide a proven alternative for high-performance computing (Barsdell et al 2010), particularly for N-body codes (Hamada et al 2009). The models are run at the Center for Astrophysics and Supercomputing of Swinburne University.…”
Section: The Modelsmentioning
confidence: 99%