2020
DOI: 10.1145/3374916
|View full text |Cite
|
Sign up to set email alerts
|

Architecture and Performance of Devito, a System for Automated Stencil Computation

Abstract: Stencil computations are a key part of many high-performance computing applications, such as image processing, convolutional neural networks, and finite-difference solvers for partial differential equations. Devito is a framework capable of generating highly optimized code given symbolic equations expressed in Python , specialized in, but not limited to, affine (stencil) codes. The lowering process—from mathematical equations down to C++ code—is performed by the Devito compiler through … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
76
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
2
2
1

Relationship

2
7

Authors

Journals

citations
Cited by 102 publications
(78 citation statements)
references
References 23 publications
2
76
0
Order By: Relevance
“…Compared to earlier work, addressing the memory demands of (LS‐)RTM via on‐fly‐Fourier transforms (Witte et al ., 2019c), we tackle the problem of on‐the‐fly source estimation in the time domain. Because we use industry‐strength time‐domain finite‐difference propagators provided by Devito (Luporini et al ., 2018; Louboutin et al ., 2019) and exposed in the Julia programming language by JUDI (Witte et al ., 2019b), our approach scales in principle to large 3D industrial problems. While we stressed the importance of estimating the source function, we believe that the sensitivity of least‐squares RTM (LS‐RTM) to errors in the background velocity model needs to be studied more carefully albeit early work on time‐harmonic LS‐RTM shows robustness with respect to these errors (Tu and Herrmann, 2015a).…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Compared to earlier work, addressing the memory demands of (LS‐)RTM via on‐fly‐Fourier transforms (Witte et al ., 2019c), we tackle the problem of on‐the‐fly source estimation in the time domain. Because we use industry‐strength time‐domain finite‐difference propagators provided by Devito (Luporini et al ., 2018; Louboutin et al ., 2019) and exposed in the Julia programming language by JUDI (Witte et al ., 2019b), our approach scales in principle to large 3D industrial problems. While we stressed the importance of estimating the source function, we believe that the sensitivity of least‐squares RTM (LS‐RTM) to errors in the background velocity model needs to be studied more carefully albeit early work on time‐harmonic LS‐RTM shows robustness with respect to these errors (Tu and Herrmann, 2015a).…”
Section: Discussionmentioning
confidence: 99%
“…To keep our time‐domain wave‐equation solvers with finite differences numerically stable (in our implementation, we used Devito, https://www.devitoproject.org) for our time‐domain finite‐difference simulations and gradient computations (Luporini et al ., 2018; Louboutin et al ., 2019) and JUDI (https://github.com/slimgroup/JUDI.jl) as an abstract linear algebra interface to our algorithms (Witte et al ., 2019b), we introduce an initial guess for the source‐time function q0 with a bandwidth‐limited spectrum that is flat over the frequency range of interest. Under some assumptions on the source‐time function, we can write the true source‐time function as the convolution between the initial guess and the unknown filter boldw – that is, we have q=w*boldq0 where the symbol * denotes the temporal convolution.…”
Section: Methodsmentioning
confidence: 99%
“…Reference Problem We use Devito (Kukreja et al, 2016;Luporini et al, 2018) to build an acoustic wave propagation experiment. The velocity model was initialized using the SEG Overthrust model.…”
Section: Methodsmentioning
confidence: 99%
“…There have been few attempts to provide a modular framework for the stencil computations. Devito [29], [30] generates optimized wave propagation kernels for use in seismic imaging. However, Devito relies on the fork-join paradigm from the OpenMP programming model [31] and SIMD optimizations for parallel performance.…”
Section: Related Workmentioning
confidence: 99%