Connections between Numerical Algorithms for PDEs and Neural Networks

Alt, Tobias; Schrader, Karl; Augustin, Matthias; Peter, Pascal; Weickert, Joachim

doi:10.48550/arxiv.2107.14742

Cited by 4 publications

(6 citation statements)

References 82 publications

(162 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Although FSI requires an additional computation of the sum of two vectors, the computational costs are less than FED because non-varying time step sizes are used within the matrix-vector multiplications in (34), meaning I + τL must be computed only once. Another benefit of FSI is its straightforward use for nonlinear problems L(u k ), where the scheme allows us to perform nonlinear updates within one cycle.…”

Section: Fast Semi-iterative Diffusionmentioning

confidence: 99%

“…However, the schemes can be applied to many parabolic problems, including time-dependent boundary condition, also in an engineering context as demonstrated in this work. Beyond this, FSI can also be translated to neural architectures and thus offers a high level of practical relevance, we refer the interested reader to the current work [34].…”

Section: Fast Semi-iterative Diffusionmentioning

confidence: 99%

See 1 more Smart Citation

Efficient Long-Term Simulation of the Heat Equation with Application in Geothermal Energy Storage

Bähr

Breuß

2022

Mathematics

View full text Add to dashboard Cite

Long-term evolutions of parabolic partial differential equations, such as the heat equation, are the subject of interest in many applications. There are several numerical solvers marking the state-of-the-art in diverse scientific fields that may be used with benefit for the numerical simulation of such long-term scenarios. We show how to adapt some of the currently most efficient numerical approaches for solving the fundamental problem of long-term linear heat evolution with internal and external boundary conditions as well as source terms. Such long-term simulations are required for the optimal dimensioning of geothermal energy storages and their profitability assessment, for which we provide a comprehensive analytical and numerical model. Implicit methods are usually considered the best choice for resolving long-term simulations of linear parabolic problems; however, in practice the efficiency of such schemes in terms of the combination of computational load and obtained accuracy may be a delicate issue, as it depends very much on the properties of the underlying model. For example, one of the challenges in long-term simulation may arise by the presence of time-dependent boundary conditions, as in our application. In order to provide both a computationally efficient and accurate enough simulation, we give a thorough discussion of the various numerical solvers along with many technical details and own adaptations. By our investigation, we focus on two largely competitive approaches for our application, namely the fast explicit diffusion method originating in image processing and an adaptation of the Krylov subspace model order reduction method. We validate our numerical findings via several experiments using synthetic and real-world data. We show that we can obtain fast and accurate long-term simulations of typical geothermal energy storage facilities. We conjecture that our techniques can be highly useful for tackling long-term heat evolution in many applications.

show abstract

Section: Fast Semi-iterative Diffusionmentioning

confidence: 99%

Section: Fast Semi-iterative Diffusionmentioning

confidence: 99%

Efficient Long-Term Simulation of the Heat Equation with Application in Geothermal Energy Storage

Bähr

Breuß

2022

Mathematics

View full text Add to dashboard Cite

show abstract

“…Another approach considers a neural network as an operator between Euclidean spaces of same dimension depending on the discretization of the PDE [29,30,31,32,33]. This approach depends on the discretization and requires to modify the architecture of the network when the discrete resolution or the discretization are changed; • Most neural network architectures can be interpreted as numerical schemes [34,35]. Neural networks can therefore be seen as operators acting between infinite-dimensional spaces (typically spaces of functions): for instance, for a time-dependent PDE, the forward propagation of an associated neural network can be viewed as the flow associated to the PDE when a time-step δ t is fixed.…”

Section: Introductionmentioning

confidence: 99%

Learning phase field mean curvature flows with neural networks

Bretin,

Denis,

Masnou

et al. 2021

Preprint

View full text Add to dashboard Cite

We introduce in this paper new, efficient numerical methods based on neural networks for the approximation of the mean curvature flow of either oriented or non-orientable surfaces. To learn the correct interface evolution law, our neural networks are trained on phase field representations of exact evolving interfaces. The structure of the networks draws inspiration from splitting schemes used for the discretization of the Allen-Cahn equation. But when the latter approximates the mean curvature motion of oriented interfaces only, the approach we propose extends very naturally to the non-orientable case. In addition, although trained on smooth flows only, our networks can handle singularities as well. Furthermore, they can be coupled easily with additional constraints which allows us to show various applications illustrating the flexibility and efficiency of our approach: mean curvature flows with volume constraint, multiphase mean curvature flows, numerical approximation of Steiner trees, numerical approximation of minimal surfaces.

show abstract

“…We show that by integrating TV smoothing steps into existing network architectures (as a non-pointwise activation), we improve the performance of CNNs in classification and semantic segmentation tasks. A related TV approach was suggested very recently in [2], albeit not in the context of quantization, and using a learnt activation function. Next, we examine the behaviour of quantization under symmetric and stable, heat equation-like CNNs [32,1,2].…”

Section: Introductionmentioning

confidence: 99%

“…A related TV approach was suggested very recently in [2], albeit not in the context of quantization, and using a learnt activation function. Next, we examine the behaviour of quantization under symmetric and stable, heat equation-like CNNs [32,1,2]. We show that the quantization process produces significantly lighter-weight networks, in terms of storage and computation, while only incurring a minimal loss of accuracy.…”

Section: Introductionmentioning

confidence: 99%

Quantized convolutional neural networks through the lens of partial differential equations

Ben-Yair¹,

Shalom²,

Eliasof³

et al. 2021

Preprint

View full text Add to dashboard Cite

Quantization of Convolutional Neural Networks (CNNs) is a common approach to ease the computational burden involved in the deployment of CNNs, especially on low-resource edge devices. However, fixed-point arithmetic is not natural to the type of computations involved in neural networks. In this work, we explore ways to improve quantized CNNs using PDE-based perspective and analysis. First, we harness the total variation (TV) approach to apply edge-aware smoothing to the feature maps throughout the network. This aims to reduce outliers in the distribution of values and promote piecewise constant maps, which are more suitable for quantization. Secondly, we consider symmetric and stable variants of common CNNs for image classification, and Graph Convolutional Networks (GCNs) for graph node-classification. We demonstrate through several experiments that the property of forward stability preserves the action of a network under different quantization rates. As a result, stable quantized networks behave similarly to their non-quantized counterparts even though they rely on fewer parameters. We also find that at times, stability even aids in improving accuracy. These properties are of particular interest for sensitive, resource-constrained, low-power or real-time applications like autonomous driving.

show abstract

Connections between Numerical Algorithms for PDEs and Neural Networks

Cited by 4 publications

References 82 publications

Efficient Long-Term Simulation of the Heat Equation with Application in Geothermal Energy Storage

Efficient Long-Term Simulation of the Heat Equation with Application in Geothermal Energy Storage

Learning phase field mean curvature flows with neural networks

Quantized convolutional neural networks through the lens of partial differential equations

Contact Info

Product

Resources

About