2020
DOI: 10.1016/j.compfluid.2020.104541
|View full text |Cite
|
Sign up to set email alerts
|

High-order matrix-free incompressible flow solvers with GPU acceleration and low-order refined preconditioners

Abstract: We present a matrix-free flow solver for high-order finite element discretizations of the incompressible Navier-Stokes and Stokes equations with GPU acceleration. For high polynomial degrees, assembling the matrix for the linear systems resulting from the finite element discretization can be prohibitively expensive, both in terms of computational complexity and memory. For this reason, it is necessary to develop matrix-free operators and preconditioners, which can be used to efficiently solve these linear syst… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
12
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
3
1

Relationship

3
6

Authors

Journals

citations
Cited by 16 publications
(12 citation statements)
references
References 60 publications
0
12
0
Order By: Relevance
“…As a consequence, the matrix-free algorithm has significantly higher arithmetic intensity than the matrix-based algorithm. On GPU-based platforms, memory transfer is typically the bottleneck, and the matrix-free algorithms can be expected to outperform algorithms requiring fully assembled matrices [23,14,13]. The appropriate choice of algorithm will depend on both polynomial degrees p and q.…”
Section: Implementation and Numerical Resultsmentioning
confidence: 99%
“…As a consequence, the matrix-free algorithm has significantly higher arithmetic intensity than the matrix-based algorithm. On GPU-based platforms, memory transfer is typically the bottleneck, and the matrix-free algorithms can be expected to outperform algorithms requiring fully assembled matrices [23,14,13]. The appropriate choice of algorithm will depend on both polynomial degrees p and q.…”
Section: Implementation and Numerical Resultsmentioning
confidence: 99%
“…Our default choice is the Minimum Residual (MINRES) method, as ∂ 2 F is symmetric but not necessarily positivedefinite. Preconditioning for matrix-free inversion is a substantial challenge and an active area of research [5]. We have the option to perform Jacobi preconditioning, as the diagonal of ∂ 2 F can be computed through tensor contractions without having the global matrix; these algorithms can be foung in files fem/tmop/tmop pa h2d.cpp and fem/tmop/tmop pa h3d.cpp for 2D and 3D, respectively.…”
Section: Second Derivative and Linear Solvermentioning
confidence: 99%
“…Obtaining the above PA complexities, however, requires that the finite element basis functions are tensor products of 1D basis functions, e.g., quadrilaterals in 2D and hexahedra in 3D. Partial assembly has become even more relevant in recent years [4,5,6] owing to its efficient use of GPU-based architectures, which are desirable for arithmetically intensive applications that do not require a large amount of data to be moved between the CPU and GPU [1].…”
Section: Introductionmentioning
confidence: 99%
“…MARBL is built on modular physics and computer science components and makes extensive use of high-order finite element numerical methods. Compared to standard low-order finite volume schemes, high-order numerical methods have more resolution/accuracy per unknown and have higher FLOP/ byte ratios meaning that more floating-point operations are performed for each piece of data retrieved from memory (Dobrev et al, 2012); (Franco et al, 2020). This leads to improved strong parallel scalability, better throughput on GPU platforms, and increased computational efficiency.…”
Section: Introductionmentioning
confidence: 99%