2021
DOI: 10.1109/access.2021.3051984
|View full text |Cite
|
Sign up to set email alerts
|

A Nearer Optimal and Faster Trained Value Iteration ADP for Discrete-Time Nonlinear Systems

Abstract: Adaptive dynamic programming (ADP) is generally implemented using three neural networks: model network, action network, and critic network. In the conventional works of the value iteration ADP, the model network is initialized randomly and trained by the backpropagation algorithm, whose results are easy to get trapped in a local minimum; both the critic network and action network are trained in each outer-loop, which is time-consuming. To approximate the optimal control policy more accurately and decrease the … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 32 publications
0
1
0
Order By: Relevance
“…Optimal control method has been proposed nearly 70 years ago (Bellman and Dreyfus, 2013; Pontryagin, 1959), and plenty of literatures are recorded (An et al, 2021; An et al, 2021; Wei et al, 2021). Adaptive dynamic programming (ADP) theory is a key direction to undertake approximate optimal control system issues of discrete-time (Hu et al, 2021; Luo et al, 2020; Zhu et al, 2020a), continuous-time (An et al, 2021; Shan et al, 2020; Wei et al, 2020), data driven (Gao et al, 2018; Li et al, 2020; Su et al, 2020), and furthermore robot systems with input/output constraints (An et al, 2020; Lu et al, 2020; Ren et al, 2019), external disturbance (An et al, 2019; Song and Lewis, 2020; Xia et al, 2020), actuator failures (Dai et al, 2018; Jiao et al, 2018; Ma et al, 2020), and so on. Kong et al (2021b) developed n -link manipulator approximate optimal law via saturation nonlinearity.…”
Section: Introductionmentioning
confidence: 99%
“…Optimal control method has been proposed nearly 70 years ago (Bellman and Dreyfus, 2013; Pontryagin, 1959), and plenty of literatures are recorded (An et al, 2021; An et al, 2021; Wei et al, 2021). Adaptive dynamic programming (ADP) theory is a key direction to undertake approximate optimal control system issues of discrete-time (Hu et al, 2021; Luo et al, 2020; Zhu et al, 2020a), continuous-time (An et al, 2021; Shan et al, 2020; Wei et al, 2020), data driven (Gao et al, 2018; Li et al, 2020; Su et al, 2020), and furthermore robot systems with input/output constraints (An et al, 2020; Lu et al, 2020; Ren et al, 2019), external disturbance (An et al, 2019; Song and Lewis, 2020; Xia et al, 2020), actuator failures (Dai et al, 2018; Jiao et al, 2018; Ma et al, 2020), and so on. Kong et al (2021b) developed n -link manipulator approximate optimal law via saturation nonlinearity.…”
Section: Introductionmentioning
confidence: 99%