2018
DOI: 10.1007/978-3-319-78054-2_1
|View full text |Cite
|
Sign up to set email alerts
|

An Experience Report on (Auto-)tuning of Mesh-Based PDE Solvers on Shared Memory Systems

Abstract: With the advent of manycore systems, shared memory parallelisation has gained importance in high performance computing. Once a code is decomposed into tasks or parallel regions, it becomes crucial to identify reasonable grain sizes, i.e. minimum problem sizes per task that make the algorithm expose a high concurrency at low overhead. Many papers do not detail what reasonable task sizes are, and consider their findings craftsmanship not worth discussion. We have implemented an autotuning algorithm, a machine le… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
3
2

Relationship

3
2

Authors

Journals

citations
Cited by 5 publications
(8 citation statements)
references
References 7 publications
0
8
0
Order By: Relevance
“…Second, we have to continue to investigate and to invest into the methods under the hood of ExaHyPE. Examples for such new ingredients on our agenda are accelerator support, local time stepping, and more dynamic load balancing and autotuning [67,68]. Finally, we plan extensions of the core paradigm of the engine.…”
Section: Discussionmentioning
confidence: 99%
“…Second, we have to continue to investigate and to invest into the methods under the hood of ExaHyPE. Examples for such new ingredients on our agenda are accelerator support, local time stepping, and more dynamic load balancing and autotuning [67,68]. Finally, we plan extensions of the core paradigm of the engine.…”
Section: Discussionmentioning
confidence: 99%
“…Extensions of the sole grid and its traversal are available via a template mechanism and small routine collections (toolkits) that inject plotting facilities for Paraview/VisIt, realize shared memory autotuning [23] or add dynamic load balancing based upon graph partitioning or the underlying Peano space-filling curve. Further examples for toolkits are routine collections for matrix-free multigrid [56,73], Particle-in-Cell features [78] similar to [45] or patch-based PDE solvers [75].…”
Section: A Software Basementioning
confidence: 99%
“…Peano offers a plug-in point to inject proper problem size dependent into any application. If a manual identification is too cumbersome, then its toolbox collection provides a generic machine-learning algorithm to derive proper grain size choices onthe-fly (Charrier and Weinzierl 2017).…”
Section: Concurrency Impact Of the Dfs-bfs Transformationmentioning
confidence: 99%
“…Extensions of the sole grid and its traversal are available via a template mechanism and small routine collections (toolkits) that inject plotting facilities for Paraview/VisIt, realize shared memory autotuning (Charrier and Weinzierl 2017) or add dynamic load balancing based upon graph partitioning or the underlying Peano space-filling curve. Further examples for toolkits are routine collections for matrix-free multigrid (Reps and Weinzierl 2017; Weinzierl and Weinzierl 2018), Particle-in-Cell features (Weinzierl et al 2016) similar to (Kolobov and Arslanbekov 2016) or patchbased PDE solvers (Weinzierl et al 2014).…”
Section: Appendices a Software Basementioning
confidence: 99%