2015
DOI: 10.1145/2739047
|View full text |Cite
|
Sign up to set email alerts
|

An Optimizing Code Generator for a Class of Lattice-Boltzmann Computations

Abstract: The Lattice-Boltzmann method (LBM), a promising new particle-based simulation technique for complex and multiscale fluid flows, has seen tremendous adoption in recent years in computational fluid dynamics. Even with a state-of-the-art LBM solver such as Palabos, a user has to still manually write the program using library-supplied primitives. We propose an automated code generator for a class of LBM computations with the objective to achieve high performance on modern architectures. Few studies have looked at … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
10
0

Year Published

2015
2015
2025
2025

Publication Types

Select...
8
1

Relationship

1
8

Authors

Journals

citations
Cited by 16 publications
(10 citation statements)
references
References 29 publications
0
10
0
Order By: Relevance
“…There has been a great amount of work [11,13,22,25,33,34] reported on the evaluation of diamond tiling. It was also generalized to handle iterated stencils defined over periodic data domains with index set splitting [5] and the Lattice-Boltzmann method [26]. Unlike overlapped tiling and split tiling, diamond tiling may work with arbitrary affine dependences.…”
Section: Related Workmentioning
confidence: 99%
“…There has been a great amount of work [11,13,22,25,33,34] reported on the evaluation of diamond tiling. It was also generalized to handle iterated stencils defined over periodic data domains with index set splitting [5] and the Lattice-Boltzmann method [26]. Unlike overlapped tiling and split tiling, diamond tiling may work with arbitrary affine dependences.…”
Section: Related Workmentioning
confidence: 99%
“…The stencil benchmarks were optimized for cache locality using the Pluto heuristic [Pluto 2008]. The unsharp-mask and harris-corner kernels were taken from PolyMage [Mullapudi et al 2015] while the LBM benchmarks are due to the work of [Pananilath et al 2015]. Note that in all cases where we perform tiling, tile sizes Table I.…”
Section: Implementation and Practical Impactmentioning
confidence: 99%
“…Their results show excellent weak and strong scalability with 8192 CPU cores. Pananilath I. et al developed an automated code generator for LBM simulations, featuring optimization techniques of tiling, load balancing, SIMD, etc. to boost LBM codes' performance.…”
Section: Related Workmentioning
confidence: 99%