2015
DOI: 10.1002/cpe.3489
|View full text |Cite
|
Sign up to set email alerts
|

Chip‐level and multi‐node analysis of energy‐optimized lattice Boltzmann CFD simulations

Abstract: Memory-bound algorithms show complex performance and energy consumption behavior on multicore processors. We choose the lattice Boltzmann method on an Intel Sandy Bridge cluster as a prototype scenario to investigate if and how single-chip performance and power characteristics can be generalized to the highly parallel case. First, we perform an analysis of a sparse-lattice lattice Boltzmann method implementation for complex geometries. Using a single-core performance model, we predict the intra-chip saturation… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
24
0

Year Published

2015
2015
2017
2017

Publication Types

Select...
3
2

Relationship

1
4

Authors

Journals

citations
Cited by 30 publications
(24 citation statements)
references
References 26 publications
0
24
0
Order By: Relevance
“…however, we think that correlations between several parameters better highlight the underlying mechanisms; other authors share this attitude (see, for instance, Calore et al 22 ). ‡ The NVIDIA K80 memory clock frequency can also be set to 324 MHz, but such a low frequency is designed to be useful only to reduce power drain when the processor idles.…”
Section: Performance and Energy Modelsmentioning
confidence: 80%
See 1 more Smart Citation
“…however, we think that correlations between several parameters better highlight the underlying mechanisms; other authors share this attitude (see, for instance, Calore et al 22 ). ‡ The NVIDIA K80 memory clock frequency can also be set to 324 MHz, but such a low frequency is designed to be useful only to reduce power drain when the processor idles.…”
Section: Performance and Energy Modelsmentioning
confidence: 80%
“…Other quantities -e.g. the energy-delay product (EDP) -have been proposed in the literature, in an attempt to define a single figure-of-merit; however we think that correlations between several parameters better highlight the underlying mechanisms; other authors share this attitude (see for instance [22]). …”
Section: Performance and Energy Modelsmentioning
confidence: 99%
“…See, for example, [10] for the general methodology and [14,23,24] for a case studies with various algorithms. Beyond these immediate results, we have demonstrated the application of a model-guided and bottleneck-focused performance engineering effort on the example of the Kahan summation algorithm.…”
Section: Resultsmentioning
confidence: 99%
“…We emphasize that the approach and insights described here for the special case of the Kahan scalar product can serve as a blueprint for other streaming kernels. See, for example, [10] for the general methodology and [14,23,24] for a case studies with various algorithms.…”
Section: Resultsmentioning
confidence: 99%
“…Therefore, optimizing them on recent platforms and for different application cases has been searched intensively in the last 10 years. In their article Chip-level and Multi-node Analysis of Energy-optimized Lattice-Boltzmann CFD Simulation, Wittmann et al [24] analyze the behavior of D3Q19 lattice-Boltzmann solvers on modern HPC systems. They first present chip-level models for both performance and energy consumption.…”
Section: Pattern Recognitionsmentioning
confidence: 99%