2018
DOI: 10.1016/j.robot.2018.05.016
|View full text |Cite
|
Sign up to set email alerts
|

Adaptive low-level control of autonomous underwater vehicles using deep reinforcement learning

Abstract: low-level control of autonomous underwater vehicles using deep reinforcement learning, Robotics and Autonomous Systems (2018),

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
56
0
2

Year Published

2019
2019
2024
2024

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 142 publications
(58 citation statements)
references
References 53 publications
0
56
0
2
Order By: Relevance
“…The reward function is one of the major ways the designer can influence and direct the behaviour of the agent. One of the more popular alternatives to L 1 norm and clipping to achieve saturated rewards are the class of exponential reward functions, and notably the Gaussian reward function as in [37]. Analyzing different choices of the reward function was not given much focus as the original choice gave satisfying results.…”
Section: A Key Factors Impacting Trainingmentioning
confidence: 99%
“…The reward function is one of the major ways the designer can influence and direct the behaviour of the agent. One of the more popular alternatives to L 1 norm and clipping to achieve saturated rewards are the class of exponential reward functions, and notably the Gaussian reward function as in [37]. Analyzing different choices of the reward function was not given much focus as the original choice gave satisfying results.…”
Section: A Key Factors Impacting Trainingmentioning
confidence: 99%
“…Waterborne transport vehicles are divided into surface water ships and underwater ships. For underwater ships, relevant scholars use DRL based on actor-critic to achieve low-level control of autonomous underwater vehicles (AUV) [231]. For example, two neural networks are used to construct DRL [232], then actor-critic [233] or deep deterministic policy gradient (DDPG) is introduced to self-modify in subsequent research [234].…”
Section: Artificial Intelligencementioning
confidence: 99%
“…The goal in reinforcement learning is to learn a policy that maximizes the expected return J from the start distribution [9]:…”
Section: Deep Reinforcement Learning Theorymentioning
confidence: 99%
“…His method was easier to implement compared with other RL methods, but at the same time, simulated results showed a poor speed of convergence towards the minimal solution [8]. Carlucho et al developed a deep RL framework based on the Actor-Critic theory, which took the available raw sensory information as input and output the continuous control actions as low-level control commands of AUV [9]. In another article [10], an expert agent-based system, based on a reinforcement learning agent, was proposed for self-adapting multiple low-level PID controllers in mobile robots.…”
Section: Introductionmentioning
confidence: 99%