Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis 2017
DOI: 10.1145/3126908.3126920
|View full text |Cite
|
Sign up to set email alerts
|

Tessellating stencils

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
38
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 20 publications
(38 citation statements)
references
References 68 publications
0
38
0
Order By: Relevance
“…Most of these references rely on SB with an implementation based on the BSP model. To increase data locality, TB through tiling techniques (Bandishti et al, 2012;Grosser et al, 2014b;Malas et al, 2015;Orozco and Gao, 2009;Strzodka et al, 2011;Wellein et al, 2009;Wonnacott, 2000;Yuan et al, 2017;Zhou, 2013) has been widely considered using various advanced programming models to favor asynchronous execution. Performance tuning using roofline models (Datta, 2009;Etienne et al, 2017;Nguyen et al, 2010;Titarenko and Hildyard, 2017) remains an important assessment step for stencil computations to ensure a good utilization of the underlying hardware resources.…”
Section: Prior Work and Current Contributionsmentioning
confidence: 99%
See 1 more Smart Citation
“…Most of these references rely on SB with an implementation based on the BSP model. To increase data locality, TB through tiling techniques (Bandishti et al, 2012;Grosser et al, 2014b;Malas et al, 2015;Orozco and Gao, 2009;Strzodka et al, 2011;Wellein et al, 2009;Wonnacott, 2000;Yuan et al, 2017;Zhou, 2013) has been widely considered using various advanced programming models to favor asynchronous execution. Performance tuning using roofline models (Datta, 2009;Etienne et al, 2017;Nguyen et al, 2010;Titarenko and Hildyard, 2017) remains an important assessment step for stencil computations to ensure a good utilization of the underlying hardware resources.…”
Section: Prior Work and Current Contributionsmentioning
confidence: 99%
“…Christen et al, 2011;Malas, 2015;Tang et al, 2011). More thorough related work can be found at Malas et al (2017) and Yuan et al (2017).…”
Section: Prior Work and Current Contributionsmentioning
confidence: 99%
“…With the development of advanced vector instruction sets, there have been many research studies addressing the challenges of a faster computational stencil kernels on homogeneous x86 and GPU-based systems using spatial [5]- [15] or temporal blocking [16]- [26]. These stencil kernel optimizations are key components to the Reverse Time Migration (RTM), but they usually rely on simple boundary conditions (e.g., Dirichlet) and do not consider the full RTM ecosystem and specifications.…”
Section: Related Workmentioning
confidence: 99%
“…It is extensively involved in various domains from physical simulations to machine learning [8,26,36]. Stencil is also included as one of the seven computational motifs presented in the Berkeley View [3,4,51] and arises as a principal class of floating-point kernels in high-performance computing.…”
Section: Introductionmentioning
confidence: 99%
“…The naive implementation for a 𝑑-dimensional stencil contains 𝑑 +1 loops where the time dimension is traversed in the outmost loop and all grid points are updated in inner loops. Since stencil is characterized by this regular computational structure, it is inherently a bandwidth-bound kernel with a low arithmetic intensity and poor data reuse [24,51].…”
Section: Introductionmentioning
confidence: 99%