2020
DOI: 10.1109/access.2020.2975391
|View full text |Cite
|
Sign up to set email alerts
|

Data-Driven Nonlinear Near-Optimal Regulation Based on Multi-Dimensional Taylor Network Dynamic Programming

Abstract: Using the data-driven control formulation, an iterative dynamic programming approach which is based on a multi-dimensional Taylor network is established to design the near optimal regulation of discrete-time nonlinear systems. For discrete-time general nonlinear systems, the iterative adaptive dynamic programming algorithm is developed and proved to guarantee the property of convergence and optimality. Three networks are constructed, namely, the identification network, critic network and action network. Moreov… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 9 publications
(4 citation statements)
references
References 42 publications
0
4
0
Order By: Relevance
“…There are some well-known approaches to solve the MDP problems such as dynamic programming [29]. Unfortunately, these approaches depend on perfect knowledge of the dynamic variation of environment to a great extent.…”
Section: S )mentioning
confidence: 99%
“…There are some well-known approaches to solve the MDP problems such as dynamic programming [29]. Unfortunately, these approaches depend on perfect knowledge of the dynamic variation of environment to a great extent.…”
Section: S )mentioning
confidence: 99%
“…This part introduces the online PI algorithm [43] [44] to solve the HJB equation. The online PI algorithm is composed of the policy evaluation (26) and the policy improvement (27).…”
Section: Online Policy Iteration Algorithmmentioning
confidence: 99%
“…As an appropriate application for solving the optimal control problems of nonlinear systems, ADP algorithm, which was first proposed by Werbos [21], was considered as an effective approach in avoiding the difficulties of the 'curse of dimensionality'. Today, the ADP-based methods are being utilized in the designing of continue-time [22] [23], discrete time [24] [25] [26], data driven-based [27] [28] intelligence systems, and the solution of nonlinear optimal control with input/output constraints [29] [30], external disturbances [31] [32] [33] and actuator failures [34] [35]. Several investigations studied, the optimal control problems of the robot manipulator systems based on the ADP approach.…”
Section: Introductionmentioning
confidence: 99%
“…Life is full of digital signals and digital technology. As the main bad environment space of our life, urban landscape is inevitably a ected and impacted by digital technology with its high sensitivity to science and technology [2].…”
Section: Introductionmentioning
confidence: 99%