2015
DOI: 10.1007/s40571-015-0059-2
|View full text |Cite
|
Sign up to set email alerts
|

Performance improvements of differential operators code for MPS method on GPU

Abstract: In the present study, performance improvements of the particle search and particle interaction calculation steps constituting the performance bottleneck in the moving particle simulation method are achieved by developing GPU-compatible algorithms for many core processor architectures. In the improvements of particle search, bucket loops of the cell-linked list are changed to a loop structure having fewer local variables and the linked list and the forward star of particle search algorithms within a bucket are … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
6
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 17 publications
(6 citation statements)
references
References 41 publications
0
6
0
Order By: Relevance
“…To ensure computational stability and efficiency, it is recommended that the influence radius be 2-4 times the initial particle distance [16]. For particle i, the particle number density can be defined as…”
Section: Explicit Mps Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…To ensure computational stability and efficiency, it is recommended that the influence radius be 2-4 times the initial particle distance [16]. For particle i, the particle number density can be defined as…”
Section: Explicit Mps Methodsmentioning
confidence: 99%
“…The main limitation of MPS is that it can be computationally expensive and requires a large number of particles to accurately capture the flow behavior. To overcome this limitation, the parallel computing technique can be used to improve its efficiency and make it feasible for practical applications [16][17][18]. As a massively parallel processor, the GPU can achieve high accelerations and efficiency using CUDA or OpenCL programming frameworks [19,20].…”
Section: Introductionmentioning
confidence: 99%
“…Schematic diagram of the CD‐WC‐MPS coupled with geometrically nonlinear shell numerical algorithm (The reader interested on the bucket‐based domain decomposition is referred to Reference 92) [Colour figure can be viewed at wileyonlinelibrary.com]…”
Section: Methodsmentioning
confidence: 99%
“…During recent years, increased focus has shifted to improving the computational efficiency of particle methods through a multigrid scheme (Södersten et al, 2019), parallelization (Guo et al, 2018), and running particle methods on graphics processing units (GPUs) (Murotani et al, 2015;Chow et al, 2018). Moreover, several multi-resolution (Tang Axel SÖDERSTEN*, Takuya MATSUNAGA*, Seiichi KOSHIZUKA*, Tomoyuki HOSAKA** and Eiji ISHII* et al, 2016a;Tang et al, 2016b;Chen et al, 2016;Tanaka et al, 2018;Khayyer et al, 2019;Wang et al, 2019) schemes have been proposed, where high spatial resolution is only considered over critical regions of the flow, such as around clearances (Tanaka et al, 2018) and at impact with walls (Chen et al, 2016;Tanaka et al, 2018), rigid structures (Wang et al, 2019) and elastic beams (Khayyer et al, 2019;Khayyer et al, 2021).…”
Section: Introductionmentioning
confidence: 99%