2022
DOI: 10.1007/s12555-020-0809-7
|View full text |Cite
|
Sign up to set email alerts
|

Online Actor-critic Reinforcement Learning Control for Uncertain Surface Vessel Systems with External Disturbances

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0
1

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
3

Relationship

1
6

Authors

Journals

citations
Cited by 23 publications
(12 citation statements)
references
References 36 publications
0
11
0
1
Order By: Relevance
“…Just last year, Banginwar [ 25 ] offered an initial proposal of such applications (still neglecting nonlinear coupling terms from the transport theorem), where follow–on efforts should incorporate the nonlinear coupling terms. Alternatively, a trajectory tracking control approach for an uncertain surface vessel using the new cascade structure of adaptive reinforcement learning algorithm and kinematic controller, feed-forward term was offered in [ 26 ], while an adaptive reinforcement learning optimal tracking control algorithm was presented in [ 27 , 28 ] for an underactuated surface vessel subject to modeling uncertainties and time-varying external disturbances.…”
Section: Methodsmentioning
confidence: 99%
“…Just last year, Banginwar [ 25 ] offered an initial proposal of such applications (still neglecting nonlinear coupling terms from the transport theorem), where follow–on efforts should incorporate the nonlinear coupling terms. Alternatively, a trajectory tracking control approach for an uncertain surface vessel using the new cascade structure of adaptive reinforcement learning algorithm and kinematic controller, feed-forward term was offered in [ 26 ], while an adaptive reinforcement learning optimal tracking control algorithm was presented in [ 27 , 28 ] for an underactuated surface vessel subject to modeling uncertainties and time-varying external disturbances.…”
Section: Methodsmentioning
confidence: 99%
“…Importantly, we will establish the connection between the trajectory tracking control scheme and the RL-based SV control strategy, ensuring the convergence of learning weights in AC NNs. Before delving into determining the attraction region of the cascade formation control system using the two-layer structure (Figure 1), we will build upon several previous assumptions related to stability and tracking problems, 3,42,50 in addition to the following assumption: (19) for each SV agent is known and bounded by a known positive constant L, as described in the following inequality 0…”
Section: Stability Analysismentioning
confidence: 99%
“…Each SV agent adheres to the assumptions outlined in Assumptions 1 and 2 and meets the bounded conditions during the training process of neural networks, as presented in previous works. 3,50 Furthermore, the signal vectors in each SV agent satisfy the PE condition (39), given by…”
Section: Stability Analysismentioning
confidence: 99%
See 1 more Smart Citation
“…Similarly, Ref. [28] addressed a tracking control problem for an uncertain SV using ARL based cascaded structure. Current research on the other hand also employs Actor-Critic network by employing DDPG and PPO.…”
Section: Relevant Studiesmentioning
confidence: 99%