Scalable Distributed Fast Multipole Methods

Hu, Qi; Gumerov, Nail A.; Duraiswami, Ramani

doi:10.1109/hpcc.2012.44

Cited by 5 publications

(3 citation statements)

References 18 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We develop new data structures for the distributed algorithm which separate the computation and communication to avoid synchronization during GPU computations. The new data structures [7] build on the local essential tree (LET) [8], [9] concept but use a master-slave model and further have a novel parallel construction algorithm, in which the granularity is at the level of the spatial boxes (which allows finer parallelization than at the single-node level). Basically, each node divides its assigned domain into small spatial boxes via octrees and classifies each box into one of five categories in parallel.…”

Section: Major Contributionsmentioning

confidence: 99%

Abstract: Scalable Fast Multipole Methods for Vortex Element Methods

Gumerov

Yokota

et al. 2012

2012 SC Companion: High Performance Computing, Networking Storage and Analysis

View full text Add to dashboard Cite

We use a particle-based method to simulate incompressible flows, where the Fast Multipole Method (FMM) is used to accelerate the calculation of particle interactions. The most time-consuming kernels-the Biot-Savart equation and stretching term of the vorticity equation-are mathematically reformulated so that only two Laplace scalar potentials are used instead of six, while automatically ensuring divergencefree far-field computation. Based on this formulation, and on our previous work for a scalar heterogeneous FMM algorithm, we develop a new FMM-based vortex method capable of simulating general flows including turbulence on heterogeneous architectures. Our work for this poster focuses on the computation perspective and our implementation can perform one time step of the velocity+stretching for one billion particles on 32 nodes in 55.9 seconds, which yields 49.12 Tflop/s.

show abstract

Section: Major Contributionsmentioning

confidence: 99%

Abstract: Scalable Fast Multipole Methods for Vortex Element Methods

Gumerov

Yokota

et al. 2012

2012 SC Companion: High Performance Computing, Networking Storage and Analysis

View full text Add to dashboard Cite

show abstract

“…Therefore, we sought to extend our software to run on multiple nodes. There has been a lot of work done on parallelizing the FMM across many nodes (Hu et al, 2011, 2012; Lashuk et al, 2012), especially with regard to maintaining good strong and weak scaling. In many cases, multi-node FMM algorithms have been applied to the BEM (Dang et al, 2016; Malhotra and Biros, 2016; Michiels et al, 2015; Yokota et al, 2011).…”

Section: Introductionmentioning

confidence: 99%

Highly parallel boundary element method for solving extremely large, wide-area power-line models

Adelman

2020

The International Journal of High Performance Computing Applica

View full text Add to dashboard Cite

The electric and magnetic fields around power lines carry an immense amount of information about the power grid and can be used to improve stability, balance loads, conserve power, and reduce outages. To study this, an extremely large model of transmission lines over a 70-km2 tract of land near Washington, DC, has been built. The terrain was modeled accurately using 1-m-resolution LIDAR data. The 140-million-element power-line model was solved using the boundary element method, and the solvers were parallelized across DEVCOM Army Research Laboratory’s Centennial supercomputer using a modified version of the domain decomposition method. The code on each node was accelerated using the fast multipole method and, when available, GPUs. Additionally, larger test models were used to characterize the scalability of the code. The largest test model had 10,010,944,000 elements, and was solved on 1,024 nodes in 4.3 hours.

show abstract

“…The FMM can be efficiently parallelized [37]. The first implementation of the FMM on graphics processors [32] was developed further [38,39], where the FMM was implemented on heterogeneous computing architectures consisting of multicore CPUs and GPUs. This FMM parallelization strategy for heterogeneous architectures was successfully used in fluid and molecular dynamics [40,41,42,43,44] and in electro-and magnetostatics [45].…”

Section: Introductionmentioning

confidence: 99%

GPU accelerated fast multipole boundary element method for simulation of 3D bubble dynamics in potential flow

Gumerov,

Pityuk,

Abramova

et al. 2019

Preprint

Self Cite

View full text Add to dashboard Cite

A numerical method for simulation of bubble dynamics in three-dimensional potential flows is presented. The approach is based on the boundary element method for the Laplace equation accelerated via the fast multipole method implemented on a heterogeneous CPU/GPU architecture. For mesh stabilization, a new smoothing technique using a surface filter is presented. This technique relies on spherical harmonics expansion of surface functions for bubbles topologically equivalent to a sphere (or Fourier series for toroidal bubbles). The method is validated by comparisons with solutions available in the literature and convergence studies for bubbles in acoustic fields. The accuracy and performance of the algorithm are discussed. It is demonstrated that the approach enables simulation of dynamics of bubble clusters with thousands of bubbles and millions of boundary elements on contem-

show abstract

Scalable Distributed Fast Multipole Methods

Cited by 5 publications

References 18 publications

Abstract: Scalable Fast Multipole Methods for Vortex Element Methods

Abstract: Scalable Fast Multipole Methods for Vortex Element Methods

Highly parallel boundary element method for solving extremely large, wide-area power-line models

GPU accelerated fast multipole boundary element method for simulation of 3D bubble dynamics in potential flow

Contact Info

Product

Resources

About