2020
DOI: 10.48550/arxiv.2006.02361
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Optimizing Neural Networks via Koopman Operator Theory

Abstract: Koopman operator theory, a powerful framework for discovering the underlying dynamics of nonlinear dynamical systems, was recently shown to be intimately connected with neural network training. In this work, we take the first steps in making use of this connection. As Koopman operator theory is a linear theory, a successful implementation of it in evolving network weights and biases offers the promise of accelerated training, especially in the context of deep networks, where optimization is inherently a non-co… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 25 publications
0
2
0
Order By: Relevance
“…Another way to apply Koopman operator theory to neural networks is to consider the learning process of weights as a dynamical system. For example, in [23], the Koopman operator is constructed from the early-stage learning trajectory of weights. Then, the use of the constructed Koopman operator in the subsequent optimization process significantly reduces the learning time.…”
Section: J Stat Mech (2024) 073401mentioning
confidence: 99%
“…Another way to apply Koopman operator theory to neural networks is to consider the learning process of weights as a dynamical system. For example, in [23], the Koopman operator is constructed from the early-stage learning trajectory of weights. Then, the use of the constructed Koopman operator in the subsequent optimization process significantly reduces the learning time.…”
Section: J Stat Mech (2024) 073401mentioning
confidence: 99%
“…Beyond the computational speed and the reduction in feedback complexity, the linear representation-based control could lead to better performance compared to a controller that is based on the original nonlinear system [32]. The Koopman operator can also be readily combined with machine learning tools to help learn unknown dynamics from data [33]- [43].…”
Section: Introductionmentioning
confidence: 99%