2022
DOI: 10.1002/rnc.6169
|View full text |Cite
|
Sign up to set email alerts
|

Output‐feedback Q‐learning for discrete‐time linear H tracking control: A Stackelberg game approach

Abstract: In this article, an output‐feedback Q‐learning algorithm is proposed for the discrete‐time linear system to deal with the H∞$$ {H}_{\infty } $$ tracking control problem. The problem is formulated as a zero‐sum game in the Stackelberg game framework with a discount factor to make the value function bounded. According to the principle of optimality, the game algebraic Riccati equation (GARE) is derived and solved by the Q‐learning algorithm to get the optimal solution of the Stackelberg game without requiring th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5

Relationship

0
5

Authors

Journals

citations
Cited by 8 publications
(5 citation statements)
references
References 54 publications
(114 reference statements)
0
4
0
Order By: Relevance
“…For example, reference 8 studied the output tracking control problem of Hamiltonian descriptor systems with input saturation and delay. Moreover, reference 9 developed an output‐feedback Q‐learning algorithm for discrete‐time linear systems to solve the problem of tracking control.…”
Section: Introductionmentioning
confidence: 99%
“…For example, reference 8 studied the output tracking control problem of Hamiltonian descriptor systems with input saturation and delay. Moreover, reference 9 developed an output‐feedback Q‐learning algorithm for discrete‐time linear systems to solve the problem of tracking control.…”
Section: Introductionmentioning
confidence: 99%
“…[24][25][26] Q-learning algorithm has also been proposed to solve the game algebraic Riccati equation (GARE) of discrete-time linear systems. [27][28][29][30][31] The solution of the GARE leads to the realization of H ∞ tracking control.…”
Section: Introductionmentioning
confidence: 99%
“…By combining the Q‐function, RL methods can achieve model‐free optimal control of CT or discrete‐time (DT) systems 24‐26 . Q‐learning algorithm has also been proposed to solve the game algebraic Riccati equation (GARE) of discrete‐time linear systems 27‐31 . The solution of the GARE leads to the realization of H$$ {H}_{\infty } $$ tracking control.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations