Efficient Natural Gradient Descent Methods for Large-Scale Optimization Problems

Nurbekyan, Levon; Lei, Wanzhou; Yang, Yunan

doi:10.48550/arxiv.2202.06236

Cited by 2 publications

(2 citation statements)

References 25 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The adjoint-state method is an efficient technique by which we can evaluate the derivative ∂ θ J , as the computation time is largely independent of the size of θ. One can derive the adjoint-state method for gradient computations by differentiating the discrete constraint [52], which in our case is the eigenvector problem…”

Section: Gradient Calculation Through the Adjoint-state Methodsmentioning

confidence: 99%

Learning Dynamical Systems From Invariant Measures

Botvinick-Greenhouse¹,

Martin²,

Yang³

2023

Preprint

View full text Add to dashboard Cite

We extend the methodology in [65] to learn autonomous continuous-time dynamical systems from invariant measures. We assume that our data accurately describes the dynamics' asymptotic statistics but that the available time history of observations is insufficient for approximating the Lagrangian velocity. Therefore, invariant measures are treated as the inference data and velocity learning is reformulated as a data-fitting, PDE-constrained optimization problem in which the stationary distributional solution to the Fokker-Planck equation is used as a differentiable surrogate forward model. We consider velocity parameterizations based upon global polynomials, piecewise polynomials, and fully connected neural networks, as well as various objective functions to compare synthetic and reference invariant measures. We utilize the adjoint-state method together with the backpropagation technique to efficiently perform gradient-based parameter identification. Numerical results for the Van der Pol oscillator and Lorenz-63 system, together with real-world applications to Hall-effect thruster dynamics and temperature prediction, are presented to demonstrate the effectiveness of the proposed approach.

show abstract

Section: Gradient Calculation Through the Adjoint-state Methodsmentioning

confidence: 99%

Learning Dynamical Systems From Invariant Measures

Botvinick-Greenhouse¹,

Martin²,

Yang³

2023

Preprint

View full text Add to dashboard Cite

show abstract

“…It could also lead to different gradient flow formulations when passing the Fréchet derivative to the gradient, thus giving rise to different gradient descent algorithms for solving such nonconvex optimization problems. Both choices affect the convergence rate and potentially change the stationary points to which the iterative gradient-based algorithm converges, even with the same initial guess [34]. We will demonstrate this later in section 6.…”

Section: Introductionmentioning

confidence: 98%

Norm-dependent convergence and stability of the inverse scattering series for diffuse and scalar waves

Mahankali

Yang

2023

Inverse Problems

View full text Add to dashboard Cite

This work analyzes the forward and inverse scattering series for scalar waves based on the Helmholtz equation and the diffuse waves from the time-independent diffusion equation, which are important PDEs in various applications. Different from previous works, which study the radius of convergence for the forward and inverse scattering series, the stability, and the approximation error of the series under the $L^p$ norms, we study these quantities under the Sobolev $H^s$ norm, which associates with a general class of $L^2$-based function spaces. The $H^s$ norm has a natural spectral bias based on its definition in the Fourier domain: the case $s<0$ biases towards the lower frequencies, while the case $s>0$ biases towards the higher frequencies. We compare the stability estimates using different $H^s$ norms for both the parameter and data domains and provide a theoretical justification for the frequency weighting techniques in practical inversion procedures. We also provide numerical inversion examples to demonstrate the differences in the inverse scattering radius of convergence under different metric spaces.

show abstract

Efficient Natural Gradient Descent Methods for Large-Scale Optimization Problems

Cited by 2 publications

References 25 publications

Learning Dynamical Systems From Invariant Measures

Learning Dynamical Systems From Invariant Measures

Norm-dependent convergence and stability of the inverse scattering series for diffuse and scalar waves

Contact Info

Product

Resources

About