2012
DOI: 10.1016/j.jocs.2011.01.006
|View full text |Cite
|
Sign up to set email alerts
|

Implementation of a parallel tree method on a GPU

Abstract: The kd-tree is a fundamental tool in computer science. Among other applications, the application of kd-tree search (by the tree method) to the fast evaluation of particle interactions and neighbor search is highly important, since the computational complexity of these problems is reduced from O(N 2 ) for a brute force method to O(N log N) for the tree method, where N is the number of particles. In this paper, we present a parallel implementation of the tree method running on a graphics processing unit (GPU). W… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
27
0

Year Published

2012
2012
2019
2019

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 31 publications
(27 citation statements)
references
References 27 publications
0
27
0
Order By: Relevance
“…The substantial component of global GPU memory usage optimization is achieving coalescing [17], which in this particular case means processing of memory-wise nearby particles by index-wise nearby work-items in the same work-group. We can achieve it by keeping local ordering of particles which seems to be a common optimization strategy for a wide set of particle-mesh algorithms [18,19].…”
Section: Implementation Detailsmentioning
confidence: 99%
“…The substantial component of global GPU memory usage optimization is achieving coalescing [17], which in this particular case means processing of memory-wise nearby particles by index-wise nearby work-items in the same work-group. We can achieve it by keeping local ordering of particles which seems to be a common optimization strategy for a wide set of particle-mesh algorithms [18,19].…”
Section: Implementation Detailsmentioning
confidence: 99%
“…Adopting the tree method is a common way to accelerate collisionless N-body simulations in astrophysics, even on GPU. Many earlier studies presented tree codes efficiently running on GPU(s), yet none had coupled their code with the block time step (Nakasato, 2012;Ogiya et al, 2013;Bédorf et al, 2012Bédorf et al, , 2014Watanabe and Nakasato, 2014). Since the block time step can also accelerate N-body simulations significantly, we have developed a gravitational octree code (GOTHIC), which is accelerated by the block time step.…”
Section: Discussionmentioning
confidence: 99%
“…The focus of this work is on massivelyparallel processing of k-d trees. While several implementations have been proposed that address such traversals from a more general perspective (e.g., in the context of ray tracing) [13,17,20,26], these approaches are not suited for nearest neighbor search in moderate-sized feature spaces (i.e., d > 3), except for the recently proposed buffer k-d tree extension [11].…”
Section: Massively-parallel Nearest Neighbor Computationsmentioning
confidence: 99%