2019
DOI: 10.1016/j.jcp.2019.04.010
|View full text |Cite
|
Sign up to set email alerts
|

A GPU accelerated discontinuous Galerkin incompressible flow solver

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
23
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5
1
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 30 publications
(23 citation statements)
references
References 42 publications
0
23
0
Order By: Relevance
“…Comprehensive work on the topic has been undertaken by Karakus et al [53] who introduced a GPU-accelerated DG solver that uses a semi-lagragian subcycling approach and computes the pressure with a preconditioned conjugate gradient method.…”
Section: Incompressible Flowsmentioning
confidence: 99%
“…Comprehensive work on the topic has been undertaken by Karakus et al [53] who introduced a GPU-accelerated DG solver that uses a semi-lagragian subcycling approach and computes the pressure with a preconditioned conjugate gradient method.…”
Section: Incompressible Flowsmentioning
confidence: 99%
“…In the standard case of simplicial or mapped box-type elements, the use of nodal basis functions is common practice as it offers significant computational savings [20,28,26,15,25,35,32,16]. This is because nodal basis functions allow for offline precomputation of the evaluation of the basis functions on the quadrature points, which are transferred into the physical domain via elemental maps.…”
Section: Computing the Integralsmentioning
confidence: 99%
“…The benefits of GPU-acceleration in the context of discontinuous Galerkin methods have been studied extensively in the literature over the last decade or so for various classes of electromagnetic, fluid flow and other hyperbolic PDE problems; we refer to [28,26,15,25,35,32,16] for some of the most successful results in the area. The predominant application setting involves explicit time-stepping, e.g., by structure-preserving Runge-Kutta methods, combined with discontinuous Galerkin spatial discretizations with nodal representation of local finite element spaces [20].…”
mentioning
confidence: 99%
See 1 more Smart Citation
“…By using the OCCA [19,20] library's unified API, NekRS can run on CPUs and on GPU-accelerated CPUs that support CUDA, HIP, or OpenCL. For performance portability, the code is based on the open concurrent compute abstraction and leverages scalable developments in the SEM code Nek5000 and in libParanumal [21,22], which is a library of high-performance kernels for high-order discretization and PDE-based mini-apps. Critical performance results on several platforms indicates the strong scaling of NekRS including scaling to 27,648 V100s on OLCF Summit, for calculations of up to 60B grid points [23].…”
Section: Introduction On Nekrsmentioning
confidence: 99%