Studies on the energy and deep memory behaviour of a cache-oblivious, task-based hyperbolic PDE solver

Charrier, Dominic Etienne; Hazelwood, Benjamin; Tutlyaeva, Ekaterina; Bäder, Michael; Dumbser, Michael; Kudryavtsev, A. A.; Moskovsky, Alexander; Weinzierl, Tobias

doi:10.1177/1094342019842645

Cited by 16 publications

(23 citation statements)

References 11 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In general good scalability can be observed on up to 14 cores at higher orders. All ExaHyPE codes employ a hybrid parallelisation strategy with at least two MPI ranks per node [55]. Results for this hybrid parallelisation strategy are provided in Section 6.6.…”

Section: Euler Equationsmentioning

confidence: 99%

ExaHyPE: An engine for parallel dynamically adaptive simulations of wave problems

Reinarz

Charrier

Bäder

et al. 2020

Computer Physics Communications

Self Cite

View full text Add to dashboard Cite

ExaHyPE ("An Exascale Hyperbolic PDE Engine") is a software engine for solving systems of first-order hyperbolic partial differential equations (PDEs). Hyperbolic PDEs are typically derived from the conservation laws of physics and are useful in a wide range of application areas. Applications powered by ExaHyPE can be run on a student's laptop, but are also able to exploit thousands of processor cores on state-of-the-art supercomputers. The engine is able to dynamically increase the accuracy of the simulation using adaptive mesh refinement where required. Due to the robustness and shock capturing abilities of ExaHyPE's numerical methods, users of the engine can simulate linear and non-linear hyperbolic PDEs with very high accuracy.Users can tailor the engine to their particular PDE by specifying evolved quantities, fluxes, and source terms. A complete simulation code for a new hyperbolic PDE can often be realised within a few hours -a task that, traditionally, can take weeks, months, often years for researchers starting from scratch. In this paper, we showcase ExaHyPE's workflow and capabilities through real-world scenarios from our two main application areas: seismology and astrophysics.PDEs written in first order form. The systems may contain both conservative and non-conservative terms. Solution method:ExaHyPE employs the discontinuous Galerkin (DG) method combined with explicit one-step ADER (arbitrary high-order derivative) time-stepping. An a-posteriori limiting approach is applied to the ADER-DG solution, whereby spurious solutions are discarded and recomputed with a robust, patch-based finite volume scheme. ExaHyPE uses dynamical adaptive mesh refinement to enhance the accuracy of the solution around shock waves, complex geometries, and interesting features.

show abstract

Section: Euler Equationsmentioning

confidence: 99%

ExaHyPE: An engine for parallel dynamically adaptive simulations of wave problems

Reinarz

Charrier

Bäder

et al. 2020

Computer Physics Communications

Self Cite

View full text Add to dashboard Cite

show abstract

“…If we study a linear variant of (1), we integrate the cell with the Cauchy--Kowalesvki procedure [14]. Here, the STP is significantly cheaper, though it still yields localized data access [8]. The time integration following the STP allows us to reuse the outcome data structure for all intermediate-in-time results.…”

Section: C77mentioning

confidence: 99%

“…Its computations per mesh cell are arithmetically intense, which is a property they share with many higher-order methods [25]. At the same time, DG's data access pattern however is very localized [11]---this helps to reduce the memory access stress [8,17,20,24,27]---and its exchange between cells along their connecting faces is conceptually simple. A combination of these two properties---high intensity to exploit vector units and dynamic adaptive mesh refinement (AMR) to invest where it pays off most---is a fit to predictions of what exascale software will have to look like [10].…”

mentioning

confidence: 99%

Enclave Tasking for DG Methods on Dynamically Adaptive Meshes

Charrier¹,

Hazelwood²,

Weinzierl³

2020

SIAM J. Sci. Comput.

Self Cite

View full text Add to dashboard Cite

show abstract

“…This allows us to single out a failing rank. The ∆t HB ensures that the system is not flooded with heartbeat messages and is not overly sensitive to small performance fluctuations [6].…”

Section: Implementation Decisionsmentioning

confidence: 99%

TeaMPI—Replication-Based Resilience Without the (Performance) Pain

Samfass

Weinzierl

Hazelwood

et al. 2020

Lecture Notes in Computer Science

Self Cite

View full text Add to dashboard Cite

In an era where we can not afford to checkpoint frequently, replication is a generic way forward to construct numerical simulations that can continue to run even if hardware parts fail. Yet, replication often is not employed on larger scales, as naïvely mirroring a computation once effectively halves the machine size, and as keeping replicated simulations consistent with each other is not trivial. We demonstrate for the ExaHyPE engine -a task-based solver for hyperbolic equation systems -that it is possible to realise resiliency without major code changes on the user side, while we introduce a novel algorithmic idea where replication reduces the time-to-solution. The redundant CPU cycles are not burned "for nothing". Our work employs a weakly consistent data model where replicas run independently yet inform each other through heartbeat messages whether they are still up and running. Our key performance idea is to let the tasks of the replicated simulations share some of their outcomes, while we shuffle the actual task execution order per replica. This way, replicated ranks can skip some local computations and automatically start to synchronise with each other. Our experiments with a production-level seismic wave-equation solver provide evidence that this novel concept has the potential to make replication affordable for large-scale simulations in high-performance computing.

show abstract

Studies on the energy and deep memory behaviour of a cache-oblivious, task-based hyperbolic PDE solver

Cited by 16 publications

References 11 publications

ExaHyPE: An engine for parallel dynamically adaptive simulations of wave problems

ExaHyPE: An engine for parallel dynamically adaptive simulations of wave problems

Enclave Tasking for DG Methods on Dynamically Adaptive Meshes

TeaMPI—Replication-Based Resilience Without the (Performance) Pain

Contact Info

Product

Resources

About