2021
DOI: 10.48550/arxiv.2103.11572
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

D3PI: Data-Driven Distributed Policy Iteration for Homogeneous Interconnected Systems

Abstract: Control of large-scale networked systems often necessitates the availability of complex models for the interactions amongst the agents. While building accurate models of these interactions could become prohibitive in many applications, data-driven control methods can circumvent model complexities by directly synthesizing a controller from the observed data. In this paper, we propose the Data-Driven Distributed Policy Iteration (D3PI) algorithm to design a feedback mechanism for a potentially large system that … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 51 publications
0
2
0
Order By: Relevance
“…However, the amount of data that is required by the algorithm scales proportionally with the size of the network. In [16], it assumed that "auxiliary" links among subsystems exist, which are then exploited to solve the LQR problem with unknown dynamics and local communication; this assumption however makes its extension to an online approach (e.g., MPC) computionally unappealing. Moreover, while there exist works that guarantee stability in centralized data-driven MPC settings [9], it is unclear how such techniques can be adapted to provide theoretical guarantees in the distributed setting.…”
Section: Introductionmentioning
confidence: 99%
“…However, the amount of data that is required by the algorithm scales proportionally with the size of the network. In [16], it assumed that "auxiliary" links among subsystems exist, which are then exploited to solve the LQR problem with unknown dynamics and local communication; this assumption however makes its extension to an online approach (e.g., MPC) computionally unappealing. Moreover, while there exist works that guarantee stability in centralized data-driven MPC settings [9], it is unclear how such techniques can be adapted to provide theoretical guarantees in the distributed setting.…”
Section: Introductionmentioning
confidence: 99%
“…Among these, Fattahi et al (2020) demonstrates a first use case of SLS theory in learning-based distributed controller learning. However, most prior work that considers distributed learning and control schemes use the stochastic noise or no-noise model, assume a known stabilizing distributed controller is given, and cannot handle general communication delay during learning, e.g., Li et al (2021b); Alonso et al (2021); Jing et al (2021); Alemzadeh and Mesbahi (2019); Talebi et al (2021a); Alemzadeh et al (2021). Ho and Doyle (2019) presents an adaptive SLS controller but requires a known stabilizing controller and does not have guaranteed stability for large uncertainties.…”
Section: Introductionmentioning
confidence: 99%