2019
DOI: 10.1109/access.2019.2919249
|View full text |Cite
|
Sign up to set email alerts
|

Optimized Backstepping Design for Ship Course Following Control Based on Actor-Critic Architecture With Input Saturation

Abstract: This paper presents a course following control method for ships based on optimized backstepping (OB) technology. The backstepping technology is employed as the main control framework since the ship course can be modeled in the strict feedback form. Based on the actor-critic architecture and radial basis function (RBF) neural network (NN), the reinforcement learning (RL) strategy is introduced to avoid the difficulty in solving the traditional Hamilton-Jacobi-Bellman (HJB) equation directly. The actor NNs are u… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
8
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 19 publications
(8 citation statements)
references
References 33 publications
0
8
0
Order By: Relevance
“…Figure 3. Based on the ideal of backstepping method [33], we transform ADRC into series form with the introduction of the virtual control variable, so the mismatched disturbance can be expanded into state by ESO to estimate and compensate. The control system series formation is as shown in Figure 4.…”
Section: A Control Analysismentioning
confidence: 99%
“…Figure 3. Based on the ideal of backstepping method [33], we transform ADRC into series form with the introduction of the virtual control variable, so the mismatched disturbance can be expanded into state by ESO to estimate and compensate. The control system series formation is as shown in Figure 4.…”
Section: A Control Analysismentioning
confidence: 99%
“…Adaptive dynamics programming (ADP) [10][11][12][13][14][15][16][17][18] is a mature and mathematical discipline proposed by Werbos [16][17][18] whereby user-defined cost function as the optimization index to determine optimal control policy with self-learning mechanism. In essence, two neural networks (NNs) are deployed to respectively approximate the cost function and control policy such that the optimality principle and optimal performance index function can be met, simultaneously, such that the Hamilton-Jacobi-Bellman (HJB) equation which does not have an analytical form in the nonlinear system can be solved and curse of dimensionality can also be avoided of the dynamics programming (DP).…”
Section: Introductionmentioning
confidence: 99%
“…By incorporating the Nussbaum-type gain into adaptive backstepping design, a robust controller was proposed for ship heading nonlinear system in the presence of unknown sign of uncertain control coefficients [24]. Later, this approach was extended to ship heading nonlinear system in the presence of uncertain control coefficients and time-varying characters [25], as well as input saturation [26], [27].…”
Section: Introductionmentioning
confidence: 99%
“…A common feature of the aforementioned studies [3]- [18], [22]- [27], [37]- [43] is that the adaptive controllers are designed based on ship steering linear model of Nomoto [44], and extension nonlinear models of Norrbin [45], Bech [46] and Nomoto [47]. The main reason lies in the fact that these models are said to be in SISO strict-feedback form [48].…”
Section: Introductionmentioning
confidence: 99%