2021
DOI: 10.1155/2021/6649625
|View full text |Cite
|
Sign up to set email alerts
|

Deep Reinforcement Learning for Vectored Thruster Autonomous Underwater Vehicle Control

Abstract: Autonomous underwater vehicles (AUVs) are widely used to accomplish various missions in the complex marine environment; the design of a control system for AUVs is particularly difficult due to the high nonlinearity, variations in hydrodynamic coefficients, and external force from ocean currents. In this paper, we propose a controller based on deep reinforcement learning (DRL) in a simulation environment for studying the control performance of the vectored thruster AUV. RL is an important method of artificial i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 17 publications
(8 citation statements)
references
References 94 publications
0
8
0
Order By: Relevance
“…The dynamic control of the robot adopts the deep RL method [9]. First, the task needs to be modeled as a Markov decision process which means that the state at the current time is only related to the state and action at the previous time.…”
Section: Dynamic Control Designmentioning
confidence: 99%
“…The dynamic control of the robot adopts the deep RL method [9]. First, the task needs to be modeled as a Markov decision process which means that the state at the current time is only related to the state and action at the previous time.…”
Section: Dynamic Control Designmentioning
confidence: 99%
“…By using the RL-based control [23], the desired dynamic control laws of the vehicle, regardless of the dynamic model, can be obtained, where the interaction between the vehicle and the environment is performed. This control process can be described as a Markov decision process, where the AUV performs actions under the current states and behaviour policy.…”
Section: Dynamic Control Designmentioning
confidence: 99%
“…In the TD3 network design, the structure of the actor network and critic network is shown in Fig. 5, which has fewer nodes than the previous work [23]. The inputs of the actor network are states and the outputs are actions.…”
Section: A =mentioning
confidence: 99%
“…Considering different factors which actually affect the control accuracy of AUV navigation control, ref. [133] developed a reward function for deep RL controller. The designed reward function can effectively improve reliability and stability, reduce energy consumption, and restrain the vectored thruster sudden change.…”
Section: Intelligent Control (1) Fuzzy Controlmentioning
confidence: 99%