2021
DOI: 10.1016/j.conengprac.2021.104807
|View full text |Cite
|
Sign up to set email alerts
|

Adaptive dynamic programming and deep reinforcement learning for the control of an unmanned surface vehicle: Experimental results

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
11
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 35 publications
(11 citation statements)
references
References 30 publications
0
11
0
Order By: Relevance
“…However, the resulting optimization problem cannot be solved analytically in the presence of constraints, and thus, numerical methods are needed. RL-based controllers provide alternative solutions to MPC either relying or without relying on models to control the process [26]- [31]. In the model-free setting, an agent interacts with the process by taking the best actions given the current states.…”
Section: A Model Predictive Control (Mpc) and Reinforcement Learning ...mentioning
confidence: 99%
“…However, the resulting optimization problem cannot be solved analytically in the presence of constraints, and thus, numerical methods are needed. RL-based controllers provide alternative solutions to MPC either relying or without relying on models to control the process [26]- [31]. In the model-free setting, an agent interacts with the process by taking the best actions given the current states.…”
Section: A Model Predictive Control (Mpc) and Reinforcement Learning ...mentioning
confidence: 99%
“…Obviously, the initial admissible control policy for nonlinear systems is difficult to obtain, which means that VI has a high application value if the convergence rate of VI can be boosted. ADP has many success examples in areas such as smart home, 20 power grid, 21 and aircraft control 22,23 . Currently, ADP has become a critical class of methods for solving optimal control policies for nonlinear systems.…”
Section: Introductionmentioning
confidence: 99%
“…ADP has many success examples in areas such as smart home, 20 power grid, 21 and aircraft control. 22,23 Currently, ADP has become a critical class of methods for solving optimal control policies for nonlinear systems.…”
Section: Introductionmentioning
confidence: 99%
“…Great effort has been made to developing ADP algorithms for discrete‐time systems, 16,17 as well as continuous‐time systems 18‐20 . In practical applications, ADP algorithms have been used broadly in many fields 21‐28 …”
Section: Introductionmentioning
confidence: 99%