2021
DOI: 10.1007/s00521-021-05909-8
|View full text |Cite
|
Sign up to set email alerts
|

Reinforcement learning-based nonlinear tracking control system design via LDI approach with application to trolley system

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3

Citation Types

0
4
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 10 publications
(4 citation statements)
references
References 29 publications
0
4
0
Order By: Relevance
“…An online actor‐critic algorithm is proposed to solve the continuous‐time infinite horizon optimal control problem in Reference 26. A novel algorithm for the nonlinear tracking problem is designed in Reference 20. An online adaptive optimal control problem for a class of nonlinear Markov jump systems (MJSs) is studied in Reference 21.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…An online actor‐critic algorithm is proposed to solve the continuous‐time infinite horizon optimal control problem in Reference 26. A novel algorithm for the nonlinear tracking problem is designed in Reference 20. An online adaptive optimal control problem for a class of nonlinear Markov jump systems (MJSs) is studied in Reference 21.…”
Section: Introductionmentioning
confidence: 99%
“…Reinforcement learning is a technique in which the agent interacts with the environment and learns an optimal policy, which avoids the need for system dynamics when designing controllers. Recently, the ideas of reinforcement learning have been used to solve the optimal control problem, [14][15][16][17][18][19][20][21] that is to design optimal controllers for the unknown or partially unknown system. Adaptive dynamic programming (ADP) [22][23][24][25] is a typical technique using the idea of reinforcement learning to solve adaptive optimal control problems.…”
Section: Introductionmentioning
confidence: 99%
“…+is topic is not a challenging one only because this class of time-delayed systems is highly confronted in the industrial process, but mainly due to the restrictions in proving the desired closed-loop performance objectives, in addition to the stability requirement. As needs be, the delicacy identified with the tracking control problem is intensively considered by researchers nowadays [2][3][4][5].…”
Section: Introductionmentioning
confidence: 99%
“…MahmoudZadeh et al [3] present an efficient data collection strategy exploiting a team of unmanned aerial vehicles (UAVs) to monitor and collect the data of a large distributed sensor network usually used for environmental monitoring, meteorology, agriculture and renewable energy applications. Tu et al [4] study a novel scheme for the tracking problem of nonlinear systems, where two reinforcement learning algorithms are proposed to design the optimal control law. Tang el al.…”
mentioning
confidence: 99%