Efficient Strict-Binning Particle-in-Cell Algorithm for Multi-core SIMD Processors

Barsamian, Yann; Charguéraud, Arthur; Hirstoaga, Sever Adrian; Mehrenberger, Michel

doi:10.1007/978-3-319-96983-1_53

Cited by 4 publications

(4 citation statements)

References 22 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Thus the multiplication of the cores without any significant increase of the memory bandwidth brings a poor speed-up. This issue is analysed in [43] and received a lot of attention for years [4,5,7]. Different workarounds are proposed to favor the locality of the data, increasing the cache reuse, therefore mitigating the number of requests to the main memory.…”

Section: Efficient Parallelization For 3d-3v Sparse Grid Particle-in-...mentioning

confidence: 99%

“…A non exhaustive overview of optimizations and parallelizations of PIC methods on shared memory architectures. In PIC simulations, the implementations are usually memory-bounded rather than compute-bounded [4,43].…”

Section: 1mentioning

confidence: 99%

“…This is thus a trade-off between the cost of re-sorting the particle population and increasing the cache-miss rate during iterations. Different elaborated data structures are proposed to alleviate the cost of the periodic particle sorting (see [4,44] and [5,6]). Another approach to mitigate the randomness of the memory accesses is to consider domain decomposition [19,42,45] with subdomains so small that they can fit in the cache system.…”

Section: 1mentioning

confidence: 99%

See 2 more Smart Citations

Efficient parallelization for 3d-3v sparse grid Particle-In-Cell: Shared memory architectures

Deluzet

Fubiani

Garrigues

et al. 2023

Journal of Computational Physics

View full text Add to dashboard Cite

Section: Efficient Parallelization For 3d-3v Sparse Grid Particle-in-...mentioning

confidence: 99%

Section: 1mentioning

confidence: 99%

Section: 1mentioning

confidence: 99%

See 1 more Smart Citation

Efficient parallelization for 3d-3v sparse grid Particle-In-Cell: Shared memory architectures

Deluzet

Fubiani

Garrigues

et al. 2023

Journal of Computational Physics

View full text Add to dashboard Cite

“…This approach improves SIMD efficiency on many-core architectures such as the Intel Xeon Phi provided that the particles arrays have enough elements. A similar approach has been extended in [16] where the authors use additional strategies such as the division of a cell's particle set into chunks to improve cache coherence and reduce memory transfers. They report acceleration when using a few hundreds particles per cell.…”

Section: Introductionmentioning

confidence: 99%

Adaptive SIMD optimizations in particle-in-cell codes with fine-grain particle sorting

Beck

Dérouillat

Lobet

et al. 2019

Computer Physics Communications

View full text Add to dashboard Cite

Particle-In-Cell (PIC) codes are broadly applied to the kinetic simulation of plasmas, from laser-matter interaction to astrophysics. Their heavy simulation cost can be mitigated by using the Single Instruction Multiple Data (SIMD) capibility, or vectorization, now available on most architectures. This article details and discusses the vectorization strategy developed in the code Smilei which takes advantage from an efficient, systematic, cell-based sorting of the particles. The PIC operators on particles (projection, push, deposition) have been optimized to benefit from large SIMD vectors on both recent and older architectures. The efficiency of these vectorized operations increases with the number of particles per cell (PPC), typically speeding up three-dimensional simulations by a factor 2 with 256 PPC. Although this implementation shows acceleration from as few as 8 PPC, it can be slower than the scalar version in domains containing fewer PPC as usually observed in vectorization attempts. This issue is overcome with an adaptive algorithm which switches locally between scalar (for few PPC) and vectorized operators (otherwise). The newly implemented methods are benchmarked on three different, large-scale simulations considering configurations frequently studied with PIC codes.

show abstract

Performance of the Particle-in-Cell Method with the Intel (Broadwell, KNL) and IBM Power9 Architectures

Berendeev

Снытников

Efimova

2019

Communications in Computer and Information Science

View full text Add to dashboard Cite

Efficient Strict-Binning Particle-in-Cell Algorithm for Multi-core SIMD Processors

Cited by 4 publications

References 22 publications

Efficient parallelization for 3d-3v sparse grid Particle-In-Cell: Shared memory architectures

Efficient parallelization for 3d-3v sparse grid Particle-In-Cell: Shared memory architectures

Adaptive SIMD optimizations in particle-in-cell codes with fine-grain particle sorting

Performance of the Particle-in-Cell Method with the Intel (Broadwell, KNL) and IBM Power9 Architectures

Contact Info

Product

Resources

About