Communication Topology Co-Design in Graph Recurrent Neural Network Based Distributed Control

Yang, Fengjun; Matni, Nikolai

doi:10.48550/arxiv.2104.13868

Cited by 3 publications

(5 citation statements)

References 11 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In this subsection, we consider utilizing sparse weight matrices to enable distributed implementations of H-DNNs. Sparsity structures in neural networks can also be used to encode prior information on relations among elements when learning graph data [23] or to perform distributed control tasks [24], [25].…”

Section: Distributed Learning Through H-dnnsmentioning

confidence: 99%

“…It is therefore important to develop large-scale DNN models for which the training can be distributed between physically separated end devices while guaranteeing satisfactory system-wide predictions. Furthermore, distributed DNN architectures enhance data privacy and fault tolerance [21], facilitate the learning from graph inputs [23] and enable the execution of distributed control tasks [24], [25].…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Hamiltonian Deep Neural Networks Guaranteeing Non-vanishing Gradients by Design

Galimberti¹,

Furieri²,

Li³

et al. 2021

Preprint

View full text Add to dashboard Cite

Deep Neural Networks (DNNs) training can be difficult due to vanishing and exploding gradients during weight optimization through backpropagation. To address this problem, we propose a general class of Hamiltonian DNNs (H-DNNs) that stem from the discretization of continuous-time Hamiltonian systems and include several existing architectures based on ordinary differential equations. Our main result is that a broad set of H-DNNs ensures non-vanishing gradients by design for an arbitrary network depth. This is obtained by proving that, using a semiimplicit Euler discretization scheme, the backward sensitivity matrices involved in gradient computations are symplectic. We also provide an upper bound to the magnitude of sensitivity matrices, and show that exploding gradients can be either controlled through regularization or avoided for special architectures. Finally, we enable distributed implementations of backward and forward propagation algorithms in H-DNNs by characterizing appropriate sparsity constraints on the weight matrices. The good performance of H-DNNs is demonstrated on benchmark classification problems, including image classification with the MNIST dataset.

show abstract

Section: Distributed Learning Through H-dnnsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Hamiltonian Deep Neural Networks Guaranteeing Non-vanishing Gradients by Design

Galimberti¹,

Furieri²,

Li³

et al. 2021

Preprint

View full text Add to dashboard Cite

show abstract

“…Unfortunately, these assumptions do not hold for the vast majority of real-world large-scale systems. This fact motivates parametrizing the functions χ i (•), π i (•) as highly nonlinear deep neural networks, even when the dynamics (1) are linear (Gama and Sojoudi, 2021;Yang and Matni, 2021).…”

Section: Problem Statementmentioning

confidence: 99%

“…These limitations motivate going beyond linear control and suggest parametrizing highly nonlinear distributed policies through Deep Neural Networks (DNNs). Specifically, the recent works ; Khan et al (2020); Gama and Sojoudi (2021); Yang and Matni (2021) have focused on training Graph Neural Networks (GNNs) that parametrize static and dynamical distributed control policies. These methods achieve remarkable performance in applications such as vehicle flocking and formation flying.…”

Section: Introductionmentioning

confidence: 99%

Distributed neural network control with dependability guarantees: a compositional port-Hamiltonian approach

Furieri¹,

Galimberti²,

Zakwan³

et al. 2021

Preprint

View full text Add to dashboard Cite

1 Large-scale cyber-physical systems require that control policies are distributed, that is, that they only rely on local real-time measurements and communication with neighboring agents. Optimal Distributed Control (ODC) problems are, however, highly intractable even in seemingly simple cases. Recent work has thus proposed training Neural Network (NN) distributed controllers. A main challenge of NN controllers is that they are not dependable during and after training, that is, the closed-loop system may be unstable, and the training may fail due to vanishing and exploding gradients. In this paper, we address these issues for networks of nonlinear port-Hamiltonian (pH) systems, whose modeling power ranges from energy systems to non-holonomic vehicles and chemical reactions. Specifically, we embrace the compositional properties of pH systems to characterize deep Hamiltonian control policies with built-in closed-loop stability guarantees -irrespective of the interconnection topology and the chosen NN parameters. Furthermore, our setup enables leveraging recent results on well-behaved neural ODEs to prevent the phenomenon of vanishing gradients by design. Numerical experiments corroborate the dependability of the proposed architecture, while matching the performance of general neural network policies.

show abstract

“…Related work DNNs have shown promise in designing both static and dynamic distributed control policies for largescale systems. Notably, Graph Neural Networks (GNNs) have achieved impressive performance in applications like vehicle flocking and formation flying [24]- [27] thanks to their inherent scalable structure. However, guaranteeing stability with general GNNs remains challenging, often requiring restrictive assumptions like linear, open-loop stable system dynamics or sufficiently small Lipschitz constants [27].…”

Section: Introductionmentioning

confidence: 99%

Universal Approximation Property of Hamiltonian Deep Neural Networks

Zakwan

d’Angelo

Ferrari-Trecate

2023

IEEE Control Syst. Lett.

View full text Add to dashboard Cite

Controlling large-scale cyber-physical systems necessitates optimal distributed policies, relying solely on local real-time data and limited communication with neighboring agents. However, finding optimal controllers remains challenging, even in seemingly simple scenarios. Parameterizing these policies using Neural Networks (NNs) can deliver good performance, but their sensitivity to small input changes can destabilize the closed-loop system. This paper addresses this issue for a network of nonlinear dissipative systems. Specifically, we leverage well-established port-Hamiltonian structures to characterize deep distributed control policies with closed-loop stability guarantees and a finite L2 gain, regardless of specific NN parameters. This eliminates the need to constrain the parameters during optimization and enables training with standard methods like stochastic gradient descent. A numerical study on the consensus control of Kuramoto oscillators demonstrates the effectiveness of the proposed controllers.

show abstract

Communication Topology Co-Design in Graph Recurrent Neural Network Based Distributed Control

Cited by 3 publications

References 11 publications

Hamiltonian Deep Neural Networks Guaranteeing Non-vanishing Gradients by Design

Hamiltonian Deep Neural Networks Guaranteeing Non-vanishing Gradients by Design

Distributed neural network control with dependability guarantees: a compositional port-Hamiltonian approach

Universal Approximation Property of Hamiltonian Deep Neural Networks

Contact Info

Product

Resources

About