2017
DOI: 10.1080/10618562.2017.1390084
|View full text |Cite
|
Sign up to set email alerts
|

Portable implementation model for CFD simulations. Application to hybrid CPU/GPU supercomputers

Abstract: Nowadays HPC systems experience a disruptive moment with a variety of novel architectures and frameworks, without any clarity of which one is going to prevail. In this context, the portability of codes across different architectures is of major importance. This paper presents a portable implementation model based on an algebraic operational approach for DNS and LES of incompressible turbulent flows using unstructured hybrid meshes. The strategy proposed consists in representing the whole time-integration algor… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
13
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
7
1

Relationship

2
6

Authors

Journals

citations
Cited by 13 publications
(15 citation statements)
references
References 27 publications
2
13
0
Order By: Relevance
“…The final speedup of the GPUs according to the CPUs of the node is of 5.44 and 3.48 times for the sphere and airplane meshes respectively. These results match other unstructured CFD codes running on GPUs [33,34]. Figure 14 (right) shows the speedup of the best GPU implementation according to the optimal CPU version at the node level.…”
Section: Gpu Performance Analysissupporting
confidence: 73%
“…The final speedup of the GPUs according to the CPUs of the node is of 5.44 and 3.48 times for the sphere and airplane meshes respectively. These results match other unstructured CFD codes running on GPUs [33,34]. Figure 14 (right) shows the speedup of the best GPU implementation according to the optimal CPU version at the node level.…”
Section: Gpu Performance Analysissupporting
confidence: 73%
“…In previous works of Oyarzun et al [18] and Álvarez et al [23], an algebra-based implementation model was proposed for the DNS and LES of incompressible turbulent flows such that the algorithm of the time-integration phase reduces to a set of only three algebraic kernels: SpMV, axpy and dot. However, a close look at Equations 17 and 18, for instance, reveals that this set is insufficient to fulfill the implementation of the flux limiter because it comprises non-linear operations.…”
Section: Algebraic Implementationmentioning
confidence: 99%
“…By casting discrete operators and mesh functions into sparse matrices and vectors, it has been shown that nearly 90% of the calculations in a typical CFD algorithm for the direct numerical simulation (DNS) and large eddy simulation (LES) of incompressible turbulent flows boil down to the following basic linear algebra subroutines: sparse matrix-vector product (SpMV), linear combination of vectors (axpy) and dot product (dot) [18]. Moreover, after the generalizations detailed in Section 3.2 this value will be raised to 100%.…”
Section: Introductionmentioning
confidence: 99%
“…As a future work, the authors are interested in extending the implementation of the framework to a hybrid MPI-openMP paradigm and study its effect on the computational efficiency. The code also has a multi-threading capability with CUDA or openCL to use GPUs as co-processors on a hybrid machine [50][51][52]. This configuration was not used in this work and we focused on CPU-only clusters.…”
Section: Single-physics Solversmentioning
confidence: 99%