2014
DOI: 10.1109/tcyb.2014.2313915
|View full text |Cite
|
Sign up to set email alerts
|

Online Adaptive Policy Learning Algorithm for $H_{\infty }$ State Feedback Control of Unknown Affine Nonlinear Discrete-Time Systems

Abstract: The problem of H∞ state feedback control of affine nonlinear discrete-time systems with unknown dynamics is investigated in this paper. An online adaptive policy learning algorithm (APLA) based on adaptive dynamic programming (ADP) is proposed for learning in real-time the solution to the Hamilton-Jacobi-Isaacs (HJI) equation, which appears in the H∞ control problem. In the proposed algorithm, three neural networks (NNs) are utilized to find suitable approximations of the optimal value function and the saddle … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
62
0

Year Published

2015
2015
2024
2024

Publication Types

Select...
9
1

Relationship

1
9

Authors

Journals

citations
Cited by 209 publications
(62 citation statements)
references
References 47 publications
0
62
0
Order By: Relevance
“…Finally, a class of practical large-scale nonlinear system in the area of power systems has been employed to demonstrate the effectiveness of the proposed control approaches. One interesting future research topic is the extension of the proposed methods to adaptive policy learning algorithm as shown in [62] for the large-scale fuzzy systems with event-triggering control.…”
Section: Fig 2 Membership Functionsmentioning
confidence: 99%
“…Finally, a class of practical large-scale nonlinear system in the area of power systems has been employed to demonstrate the effectiveness of the proposed control approaches. One interesting future research topic is the extension of the proposed methods to adaptive policy learning algorithm as shown in [62] for the large-scale fuzzy systems with event-triggering control.…”
Section: Fig 2 Membership Functionsmentioning
confidence: 99%
“…In [15][16][17][18][19], the integral reinforcement learning algorithm was used to solve optimal control problems of continuous-time nonlinear systems. Hamilton-Jacobi-Bellman (HJB) equation is one of the important keys in optimal control problems [14,20]. In linear time-invariant systems, the HJB becomes the Riccati equation, which can be solved effectively.…”
Section: Introductionmentioning
confidence: 99%
“…With the development of data acquisition and storage technology, high-dimensional data widely exist in nature [1], finance [2], industry [3][4][5], biomedicine [6][7][8] and many other fields, which contain complicated nonlinear relationship among multiple features. Finding potential useful information and building prediction model from high-dimensional data have become one of the most important aspects of data mining and knowledge discovery [9].…”
Section: Introductionmentioning
confidence: 99%