A new approach to the lattice Boltzmann method for graphics processing units

Obrecht, Christian; Kuznik, Frédéric; Tourancheau, Bernard; Roux, Jean-Jacques

doi:10.1016/j.camwa.2010.01.054

Cited by 125 publications

(81 citation statements)

References 15 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…As shown in [12], the misalignment overhead is significantly higher for store operations than for read operations. We therefore suggested in [13] to use the in-place propagation scheme outlined by Fig. 3 instead of the ordinary out-of-place propagation scheme illustrated in Fig.…”

Section: Gpu Implementations Of the Lbmmentioning

confidence: 99%

Scalable lattice Boltzmann solvers for CUDA GPU clusters

et al. 2013

Self Cite

View full text Add to dashboard Cite

The lattice Boltzmann method (LBM) is an innovative and promising approach in computational fluid dynamics. From an algorithmic standpoint it reduces to a regular data parallel procedure and is therefore well-suited to high performance computations. Numerous works report efficient implementations of the LBM for the GPU, but very few mention multi-GPU versions and even fewer GPU cluster implementations. Yet, to be of practical interest, GPU LBM solvers need to be able to perform large scale simulations. In the present contribution, we describe an efficient LBM implementation for CUDA GPU clusters. Our solver consists of a set of MPI communication routines and a CUDA kernel specifically designed to handle three-dimensional partitioning of the computation domain. Performance measurement were carried out on a small cluster. We show that the results are satisfying, both in terms of data throughput and parallelisation efficiency.

show abstract

Section: Gpu Implementations Of the Lbmmentioning

confidence: 99%

Scalable lattice Boltzmann solvers for CUDA GPU clusters

et al. 2013

Self Cite

View full text Add to dashboard Cite

show abstract

“…Figures 4 and 5 outline the two propagation schemes (in the two-dimensional case, for the sake of simplicity). It was shown in [17] that the cost of misaligned reads is of the same order of magnitude than the overhead of a rearrange kernel. It should be noted that the in-place propagation approach is simpler and exerts less pressure on hardware than the shared memory approach.…”

Section: Gpu Implementation Of the Lbmmentioning

confidence: 99%

“…Our implementation is based on the isothermal flow solver described in [17]. The lattice is a rectangular cuboid of dimensions N x × N y × N z .…”

Section: Proposed Implementationmentioning

confidence: 99%

“…Using the least significant dimension possible to span the velocity distribution reduces the occurrences of TLB misses. We experimented a 13% performance improvement over the major dimension version which is used in [17].…”

Section: Proposed Implementationmentioning

confidence: 99%

“…This program is an extended and improved version of the isothermal solver decribed in [17]. It is part of the TheLMA project [19] which aims at providing a comprehensive framework for implementing LBM solvers on GPUs and other emerging many-core architectures.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

The TheLMA project: A thermal lattice Boltzmann solver for the GPU

et al. 2012

Self Cite

View full text Add to dashboard Cite

In this paper, we consider the implementation of a thermal flow solver based on the lattice Boltzmann method (LBM) for graphics processing units (GPU). We first describe the hybrid thermal LBM model implemented, and give a concise review of the CUDA technology. The specific issues that arise with LBM on GPUs are outlined. We propose an approach for efficient handling of the thermal part. Performance is close to optimum and is significantly better than the one of comparable CPU solvers. We validate our code by simulating the differentially heated cubic cavity (DHC). The computed results for steady flow patterns are in good agreement with previously published ones. Finally, we use our solver to study the phenomenology of transitional flows in the DHC.

show abstract

Multi‐GPU solution to the lattice Boltzmann method: An application in multiscale digital rock simulation for shale formation

Chen

Yang

Amritkar

et al. 2018

Concurrency and Computation

View full text Add to dashboard Cite

Characterization of rock properties is vital in producing oil and gas from shale reservoirs in an economically viable fashion. The nano-pore structure and ultralow permeability in shale reservoirs present challenges to the traditional experimental characterization methods. Digital rock physics for the estimation of rock properties, especially for shale reservoirs, has become a powerful tool that greatly complements to lab experiments by combining advance imaging techniques with numerical simulations. The lattice Boltzmann method (LBM) is a well-applied numerical method to simulate the fluid flow in pore structures at multiple length scales. Usually, the LBM simulation is resource intense because of its computation complexity and is facing great numerical challenges in extremely large-cale computation. In this paper, we propose a multi-GPU parallel implementation of 3D LBM on a hybrid high-performance computing cluster to perform large-scale simulations in reconstructed digital rocks. The program provides multiscale solution, pore scale and representative elementary volume (REV) scale based on the resolution of digital rock images. Optimization strategies are applied on partitioning simulation domain, improving data communication efficiency and maximizing CUDA occupancy. When running on a cluster of 32 GPUs, the proposed parallel implementation achieves a speedup of 1074x comparing to the in-house sequential program.

show abstract

A new approach to the lattice Boltzmann method for graphics processing units

Cited by 125 publications

References 15 publications

Scalable lattice Boltzmann solvers for CUDA GPU clusters

Scalable lattice Boltzmann solvers for CUDA GPU clusters

The TheLMA project: A thermal lattice Boltzmann solver for the GPU

Multi‐GPU solution to the lattice Boltzmann method: An application in multiscale digital rock simulation for shale formation

Contact Info

Product

Resources

About