2009
DOI: 10.1016/j.jcp.2009.06.041
|View full text |Cite
|
Sign up to set email alerts
|

Nodal discontinuous Galerkin methods on graphics processors

Abstract: a b s t r a c tDiscontinuous Galerkin (DG) methods for the numerical solution of partial differential equations have enjoyed considerable success because they are both flexible and robust: They allow arbitrary unstructured geometries and easy control of accuracy without compromising simulation stability. Lately, another property of DG has been growing in importance: The majority of a DG operator is applied in an element-local way, with weak penalty-based element-to-element coupling.The resulting locality in me… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
253
0
3

Year Published

2011
2011
2022
2022

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 283 publications
(256 citation statements)
references
References 16 publications
0
253
0
3
Order By: Relevance
“…This is related to the fact that the algorithm is memory bandwidth limited rather than compute limited. It is well known that the use of linear finite elements with explicit time steps results in relatively few calculations being performed for a given amount of data being loaded from the memory; this is why it has been proposed to use higher order elements on GPUs [35] to increase the number of calculations per node at each time step. In the case of bandwidth limited algorithms such as this, since the amount of data which must be loaded from memory doubles for double precision, the run time will be expected to approximately double.…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…This is related to the fact that the algorithm is memory bandwidth limited rather than compute limited. It is well known that the use of linear finite elements with explicit time steps results in relatively few calculations being performed for a given amount of data being loaded from the memory; this is why it has been proposed to use higher order elements on GPUs [35] to increase the number of calculations per node at each time step. In the case of bandwidth limited algorithms such as this, since the amount of data which must be loaded from memory doubles for double precision, the run time will be expected to approximately double.…”
Section: Discussionmentioning
confidence: 99%
“…Two partitioning schemes are considered in this paper. The first is the simple 'greedy partitioner'; the algorithm follows that of [35]. The second is a more efficient partitioning scheme -the 'aligned partitioner' -which has been developed to subdivide the mesh into neat aligned blocks.…”
Section: Partitioningmentioning
confidence: 99%
See 1 more Smart Citation
“…A recent review of applications in FEM (Finite Element Method)-based structural mechanics can be found in [1], but GPUs are being used also in a variety of other contexts. In [2], the authors apply high-order Discontinuos Galerkin (DG) method to the solution of Maxwell's equa-tions. In DG methods most operators are defined locally at element level.…”
Section: Introductionmentioning
confidence: 99%
“…Hybrid implementations of multiscale finite element approaches, whereby constitutive material computations at Gauss point level are carried out on the GPU, are discussed in [13] and [14] In most of the above-mentioned applications, results are limited to single-precision arithmetic (e.g. [2,9,14,10]). It is important to recall, as suggested in [8], that, concerning NVIDIA GPUs, in all the implementations before CUDA compute capability 1.3, only single-precision floating point operations are supported directly by the hardware.…”
Section: Introductionmentioning
confidence: 99%