High-order matrix-free incompressible flow solvers with GPU acceleration and low-order refined preconditioners

Franco, Michael; Camier, Jean-Sylvain; Andrej, Julian; Pazner, Will

doi:10.1016/j.compfluid.2020.104541

Cited by 16 publications

(12 citation statements)

References 60 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…As a consequence, the matrix-free algorithm has significantly higher arithmetic intensity than the matrix-based algorithm. On GPU-based platforms, memory transfer is typically the bottleneck, and the matrix-free algorithms can be expected to outperform algorithms requiring fully assembled matrices [23,14,13]. The appropriate choice of algorithm will depend on both polynomial degrees p and q.…”

Section: Implementation and Numerical Resultsmentioning

confidence: 99%

Conservative and accurate solution transfer between high-order and low-order refined finite element spaces

Kolev¹,

Pazner²

2021

Preprint

Self Cite

View full text Add to dashboard Cite

In this paper we introduce general transfer operators between high-order and low-order refined finite element spaces that can be used to couple high-order and low-order simulations. Under natural restrictions on the low-order refined space we prove that both the high-to-low-order and low-to-high-order linear mappings are conservative, constant preserving and high-order accurate. While the proofs apply to affine geometries, numerical experiments indicate that the results hold for more general curved and mixed meshes. These operators also have applications in the context of coarsening solution fields defined on meshes with nonconforming refinement. The transfer operators for H 1 finite element spaces require a globally coupled solve, for which robust and efficient preconditioners are developed. We present several numerical results confirming our analysis and demonstrate the utility of the new mappings in the context of adaptive mesh refinement and conservative multi-discretization coupling.

show abstract

Section: Implementation and Numerical Resultsmentioning

confidence: 99%

Conservative and accurate solution transfer between high-order and low-order refined finite element spaces

Kolev¹,

Pazner²

2021

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…Our default choice is the Minimum Residual (MINRES) method, as ∂ 2 F is symmetric but not necessarily positivedefinite. Preconditioning for matrix-free inversion is a substantial challenge and an active area of research [5]. We have the option to perform Jacobi preconditioning, as the diagonal of ∂ 2 F can be computed through tensor contractions without having the global matrix; these algorithms can be foung in files fem/tmop/tmop pa h2d.cpp and fem/tmop/tmop pa h3d.cpp for 2D and 3D, respectively.…”

Section: Second Derivative and Linear Solvermentioning

confidence: 99%

“…Obtaining the above PA complexities, however, requires that the finite element basis functions are tensor products of 1D basis functions, e.g., quadrilaterals in 2D and hexahedra in 3D. Partial assembly has become even more relevant in recent years [4,5,6] owing to its efficient use of GPU-based architectures, which are desirable for arithmetically intensive applications that do not require a large amount of data to be moved between the CPU and GPU [1].…”

Section: Introductionmentioning

confidence: 99%

Accelerating High-Order Mesh Optimization Using Finite Element Partial Assembly on GPUs

Camier¹,

Dobrev²,

Knupp³

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

In this paper we present a new GPU-oriented mesh optimization method based on highorder finite elements. Our approach relies on node movement with fixed topology, through the Target-Matrix Optimization Paradigm (TMOP) and uses a global nonlinear solve over the whole computational mesh, i.e., all mesh nodes are moved together. A key property of the method is that the mesh optimization process is recast in terms of finite element operations, which allows us to utilize recent advances in the field of GPU-accelerated high-order finite element algorithms. For example, we reduce data motion by using tensor factorization and matrix-free methods, which have superior performance characteristics compared to traditional full finite element matrix assembly and offer advantages for GPU-based HPC hardware. We describe the major mathematical components of the method along with their efficient GPU-oriented implementation. In addition, we propose an easily reproducible mesh optimization test that can serve as a performance benchmark for the mesh optimization community.

show abstract

“…MARBL is built on modular physics and computer science components and makes extensive use of high-order finite element numerical methods. Compared to standard low-order finite volume schemes, high-order numerical methods have more resolution/accuracy per unknown and have higher FLOP/ byte ratios meaning that more floating-point operations are performed for each piece of data retrieved from memory (Dobrev et al, 2012); (Franco et al, 2020). This leads to improved strong parallel scalability, better throughput on GPU platforms, and increased computational efficiency.…”

Section: Introductionmentioning

confidence: 99%

Matrix-free approaches for GPU acceleration of a high-order finite element hydrodynamics application using MFEM, Umpire, and RAJA

Vargas

Stitt

Weiss

et al. 2022

The International Journal of High Performance Computing Applica

Self Cite

View full text Add to dashboard Cite

With the introduction of advanced heterogeneous computing architectures based on GPU accelerators, large-scale production codes have had to rethink their numerical algorithms and incorporate new programming models and memory management strategies in order to run efficiently on the latest supercomputers. In this work we discuss our co-design strategy to address these challenges and achieve performance and portability with MARBL, a next-generation multi-physics code in development at Lawrence Livermore National Laboratory. We present a two-fold approach, wherein new hardware is used to motivate both new algorithms and new abstraction layers, resulting in a single source application code suitable for a variety of platforms. Focusing on MARBL’s ALE hydrodynamics package, we demonstrate scalability on different platforms and highlight that many of our innovations have been contributed back to open-source software libraries, such as MFEM (finite element algorithms) and RAJA (kernel abstractions).

show abstract

High-order matrix-free incompressible flow solvers with GPU acceleration and low-order refined preconditioners

Cited by 16 publications

References 60 publications

Conservative and accurate solution transfer between high-order and low-order refined finite element spaces

Conservative and accurate solution transfer between high-order and low-order refined finite element spaces

Accelerating High-Order Mesh Optimization Using Finite Element Partial Assembly on GPUs

Matrix-free approaches for GPU acceleration of a high-order finite element hydrodynamics application using MFEM, Umpire, and RAJA

Contact Info

Product

Resources

About