2021
DOI: 10.1049/cth2.12102
|View full text |Cite
|
Sign up to set email alerts
|

Learning near‐optimal broadcasting intervals in decentralized multi‐agent systems using online least‐square policy iteration

Abstract: Here, agents learn how often to exchange information with neighbours in cooperative multi-agent systems (MASs) such that their linear quadratic regulator (LQR)-like performance indices are minimized. The investigated LQR-like cost functions capture trade-offs between the energy consumption of each agent and MAS local control performance in the presence of exogenous disturbances, delayed and noisy data. Agent energy consumption is critical for prolonging the MAS mission and is composed of both control (e.g. acc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(1 citation statement)
references
References 53 publications
0
1
0
Order By: Relevance
“…Modern optimal control theory has a rich set of analytical tools to design control strategies satisfying desirable characteristics of the excursion of the system states according to the designer's specifications in an optimal manner [19]. The LQR is one such design methodology whereby quadratic performance indices involving the control signal and the state variables are minimized in an optimal fashion [20][21][22][23]. It has a very nice stability property, that is, if the process is of single-input and single-output, then the control system has at least the phase margin of 60 • and the gain margin of infinity [24].…”
Section: Introductionmentioning
confidence: 99%
“…Modern optimal control theory has a rich set of analytical tools to design control strategies satisfying desirable characteristics of the excursion of the system states according to the designer's specifications in an optimal manner [19]. The LQR is one such design methodology whereby quadratic performance indices involving the control signal and the state variables are minimized in an optimal fashion [20][21][22][23]. It has a very nice stability property, that is, if the process is of single-input and single-output, then the control system has at least the phase margin of 60 • and the gain margin of infinity [24].…”
Section: Introductionmentioning
confidence: 99%