Multiscale Simulation of Cardiovascular flows on the IBM Bluegene/P: Full Heart-Circulation System at Red-Blood Cell Resolution

Peters, Amanda; Melchionna, Simone; Kaxiras, Efthimios; Lätt, Jonas; Sircar, Joy; Bernaschi, Massimo; Bison, Mauro; Succi, Sauro

doi:10.1109/sc.2010.33

Cited by 42 publications

(54 citation statements)

References 25 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…In order to maximize messaging performance and set the code up for an easy transition to the use of indirect addressing necessary for irregular domains, the distribution functions were stored in two dimensional arrays of (N umV elocities, zDim · yDim · xDim) allocated in contiguous memory. [1]. Fig.…”

Section: Methodsmentioning

confidence: 99%

“…f (x+c i ∆t, t+∆t) = f (x, t)−ω∆t(f (x, t)−f eq (x, t)) (1) There are two key components to the algorithm: collision and advection. The collision step is calculated through a relaxation towards local equilibrium, as shown in the right hand side of Eq.…”

Section: The Lattice Boltzmann Methodsmentioning

confidence: 99%

“…The collision step is calculated through a relaxation towards local equilibrium, as shown in the right hand side of Eq. (1). In this work, we use the most common collision operator, the Bhatnagar-Gross-Krook (BGK), which relaxes to equilibrium on a single time scale [13].…”

Section: The Lattice Boltzmann Methodsmentioning

confidence: 99%

“…In this work, we focus on two velocity models. For continuum flow, we use the common 19-speed cubic D3Q19 lattice connecting each lattice point to its first and second neighbors [1]. The associated weights and discretized velocities are given in Table I.…”

Section: The Lattice Boltzmann Methodsmentioning

confidence: 99%

“…We have developed a multiscale fluid dynamics simulation that models flow in complicated geometries from microfluidic devices to patient-specific arterial geometries obtained from computed tomography (CT) scans [1], [2]. Our initial models have focused on flow in the coronary arteries where the diameters are on the order of millimeters as shown in Fig.…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Performance Analysis of the Lattice Boltzmann Model Beyond Navier-Stokes

Randles

Kale

Hammond

et al. 2013

2013 IEEE 27th International Symposium on Parallel and Distributed Processing

Self Cite

View full text Add to dashboard Cite

Abstract-The lattice Boltzmann method is increasingly important in facilitating large-scale fluid dynamics simulations. To date, these simulations have been built on discretized velocity models of up to 27 neighbors. Recent work has shown that higher order approximations of the continuum Boltzmann equation enable not only recovery of the Navier-Stokes hydrodynamics, but also simulations for a wider range of Knudsen numbers, which is especially important in micro-and nanoscale flows. These higher-order models have significant impact on both the communication and computational complexity of the application. We present a performance study of the higherorder models as compared to the traditional ones, on both the IBM Blue Gene/P and Blue Gene/Q architectures. We study the tradeoffs of many optimizations methods such as the use of deep halo level ghost cells that, alongside hybrid programming models, reduce the impact of extended models and enable efficient modeling of extreme regimes of computational fluid dynamics.

show abstract

Section: Methodsmentioning

confidence: 99%

Section: The Lattice Boltzmann Methodsmentioning

confidence: 99%

Section: The Lattice Boltzmann Methodsmentioning

confidence: 99%

Section: The Lattice Boltzmann Methodsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Performance Analysis of the Lattice Boltzmann Model Beyond Navier-Stokes

Randles

Kale

Hammond

et al. 2013

2013 IEEE 27th International Symposium on Parallel and Distributed Processing

Self Cite

View full text Add to dashboard Cite

show abstract

Chip‐level and multi‐node analysis of energy‐optimized lattice Boltzmann CFD simulations

Wittmann¹,

Hager²,

Zeiser³

et al. 2015

Concurrency and Computation

View full text Add to dashboard Cite

Memory-bound algorithms show complex performance and energy consumption behavior on multicore processors. We choose the lattice Boltzmann method on an Intel Sandy Bridge cluster as a prototype scenario to investigate if and how single-chip performance and power characteristics can be generalized to the highly parallel case. First, we perform an analysis of a sparse-lattice lattice Boltzmann method implementation for complex geometries. Using a single-core performance model, we predict the intra-chip saturation characteristics and the optimal operating point in terms of energy-to-solution as a function of implementation details, clock frequency, vectorization, and number of active cores per chip. We show that high single-core performance and a correct choice of the number of active cores per chip are the essential optimizations for the lowest energy-to-solution at minimal performance degradation. Then we extrapolate to the Message Passing Interface (MPI)-parallel level and quantify the energy-saving potential of various optimizations and execution modes, where we find these guidelines to be even more important, especially when communication overhead is non-negligible. In our setup, we could achieve energy savings of 35% in this case, compared with a naive approach. We also demonstrate that a simple non-reflective reduction of the clock speed leaves most of the energy-saving potential unused.[1-10]. Here, we conduct a thorough analysis of performance and energy-to-solution on the chip and highly parallel levels for an MPI-parallel implementation of LBM. We start from observations of the intra-chip saturation characteristics of two different implementations, which differ in the order in which the flow data in the lattice sites are updated ('propagation methods' [10]). Then we apply the execution-cache-memory (ECM) performance model and a simple multicore power model to describe the optimal operating point in terms of performance and energy-to-solution as a function of the clock frequency and the single instruction multiple data (SIMD) vectorization. To find out whether the knowledge thus gained at the chip level can be generalized to the highly parallel case, we conduct scaling experiments on a modern cluster system up to a point where MPI communication overhead becomes significant.This paper is organized as follows. The remainder of Section 1 covers related work, the basics of the lattice Boltzmann implementations, the hardware used for testing, and a list of contributions. Section 2 then introduces, applies, and validates the ECM model on the Intel Sandy Bridge architecture. In Section 3, we use a recently introduced multicore power model to identify the optimal operating points on the chip. Finally, Section 4 presents performance data for highly parallel runs and analyzes the impact of the different parameters (clock speed, number of cores per chip, SIMD vectorization, and system baseline power). Section 5 gives a summary and an outlook to future research. Related workThe roofline model of Williams et al. [11] pr...

show abstract

A Model for Red Blood Cells in Simulations of Large‐scale Blood Flows

Melchionna

2011

Macro Theory & Simulations

Self Cite

View full text Add to dashboard Cite

Red blood cells (RBCs) are an essential component of blood. A method to include the particulate nature of blood is introduced here with the goal of studying circulation in large-scale realistic vessels. The method uses a combination of the Lattice Boltzmann method (LBM) to account for the plasma motion, and a modified Molecular Dynamics scheme for the cellular motion. Numerical results illustrate the quality of the model in reproducing known rheological properties of blood as much as revealing the effect of RBC structuring on the wall shear stress, with consequences on the development of cardiovascular diseases.

show abstract

Multiscale Simulation of Cardiovascular flows on the IBM Bluegene/P: Full Heart-Circulation System at Red-Blood Cell Resolution

Cited by 42 publications

References 25 publications

Performance Analysis of the Lattice Boltzmann Model Beyond Navier-Stokes

Performance Analysis of the Lattice Boltzmann Model Beyond Navier-Stokes

Chip‐level and multi‐node analysis of energy‐optimized lattice Boltzmann CFD simulations

A Model for Red Blood Cells in Simulations of Large‐scale Blood Flows

Contact Info

Product

Resources

About