2020
DOI: 10.1145/3424144
|View full text |Cite
|
Sign up to set email alerts
|

Automatic Code Generation for High-performance Discontinuous Galerkin Methods on Modern Architectures

Abstract: SIMD vectorization has lately become a key challenge in high-performance computing. However, hand-written explicitly vectorized code often poses a threat to the software’s sustainability. In this publication, we solve this sustainability and performance portability issue by enriching the simulation framework dune-pdelab with a code generation approach. The approach is based on the well-known domain-specific language UFL but combines it with loopy, a more powerful intermediate representation for the computation… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
7
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 13 publications
(7 citation statements)
references
References 48 publications
0
7
0
Order By: Relevance
“…For tensor-product shape functions that are integrated on a tensor-product quadrature formula, sum factorization allows to decompose these two steps into a series of one-dimensional interpolations of total cost O(pd+1) per element in d dimensions (or O(p) per unknown), compared to the naive evaluation cost of O(p2d). The sum-factorization approach has been developed in the context of the spectral element method by Orszag (1980), Patera (1984), and Tufo and Fischer (1999), see also the book by Deville et al (2002) as well as recent implementation and vectorization studies by Kronbichler and Kormann (2012, 2019), Świrydowicz et al (2019), Fischer et al (2020), Sun et al (2020), Moxey et al (2020), and Kempf et al (2021).…”
Section: Fast Matrix-free Operator Evaluationmentioning
confidence: 99%
“…For tensor-product shape functions that are integrated on a tensor-product quadrature formula, sum factorization allows to decompose these two steps into a series of one-dimensional interpolations of total cost O(pd+1) per element in d dimensions (or O(p) per unknown), compared to the naive evaluation cost of O(p2d). The sum-factorization approach has been developed in the context of the spectral element method by Orszag (1980), Patera (1984), and Tufo and Fischer (1999), see also the book by Deville et al (2002) as well as recent implementation and vectorization studies by Kronbichler and Kormann (2012, 2019), Świrydowicz et al (2019), Fischer et al (2020), Sun et al (2020), Moxey et al (2020), and Kempf et al (2021).…”
Section: Fast Matrix-free Operator Evaluationmentioning
confidence: 99%
“…The sum-factorization approach has been developed in the context of the spectral element method by Orszag (1980), Patera (1984), and Tufo and Fischer (1999), see also the book by Deville et al (2002) as well as recent implementation and vectorization studies by Kormann (2012, 2019), Świrydowicz et al (2019Świrydowicz et al ( ), Fischer et al (2020, Sun et al (2020), Moxey et al (2020), and Kempf et al (2021).…”
Section: It Involves the Vector-valued Poisson Equation In Amentioning
confidence: 99%
“…Our highly-tuned kernels use Flop-minimizing optimizations in the sum factorization algorithms, such as change of basis and even-odd decomposition [43]. These optimizations yield a speedup of 1.5×-2× compared to previous results such as [38]. Cell and face integrals are computed alternately, in order to limit the access in the source vector P to a single access from main memory and serve the other accesses primarily from caches.…”
Section: Innovations Realized 31 Matrix-free Operator Evaluationmentioning
confidence: 99%