2022
DOI: 10.1016/j.oceaneng.2022.112424
|View full text |Cite
|
Sign up to set email alerts
|

A general motion control architecture for an autonomous underwater vehicle with actuator faults and unknown disturbances through deep reinforcement learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
6
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 14 publications
(6 citation statements)
references
References 23 publications
0
6
0
Order By: Relevance
“…Deep learning and reinforcement learning have been applied in the motion control of autonomous underwater vehicles (AUVs) under conditions of actuator failure and unknown disturbances [30], [31]. To adapt to rapidly changing environments, a neural network has been used to accurately predict pitch angles [32], while a model-free RL-based controller has demonstrated potential in addressing timevarying dynamics [33].…”
Section: Related Workmentioning
confidence: 99%
“…Deep learning and reinforcement learning have been applied in the motion control of autonomous underwater vehicles (AUVs) under conditions of actuator failure and unknown disturbances [30], [31]. To adapt to rapidly changing environments, a neural network has been used to accurately predict pitch angles [32], while a model-free RL-based controller has demonstrated potential in addressing timevarying dynamics [33].…”
Section: Related Workmentioning
confidence: 99%
“…Decision making for AMR has become a well-studied problem over the years [6], but safe decision making under uncertainties remains an open challenge. Many recent approaches use learning enabled components, such as deep neural networks (DNN) [4] and deep reinforcement learning (DRL) [5], to make quick decisions with a reasonable level of accuracy for many applications [1]. However, a vast majority of these techniques do not consider uncertainties and return only one decision, which might be incorrect in the presence of measurement or process noise at runtime [7].…”
Section: Related Workmentioning
confidence: 99%
“…Furthermore, these different failures can often look the same to a human observer and can even cause confusion in state estimation-based [2] or bias measurement [3] approaches that have been proposed to deal with failures. Learning-based approaches that deal with such problems [4], [5], can encode more complex interactions in measurements, and can make better decisions, but even in such critical applications, these only return one decision and do not account for uncertainties. Thus, here we claim that if the robot could assess uncertainties by evaluating other Rahul Peddi and Nicola Bezzo are with the Departments of Systems and Information Engineering and Electrical and Computer Engineering, University of Virginia, Charlottesville, VA 22904, USA.…”
Section: Introductionmentioning
confidence: 99%
“…For the fast convergence of adaption in working environmental disturbances, Wadi et al devised a conditional adaptation law to tune the gains of kinematic and dynamic controllers in two ways: adaptive proportional control and universal adaptive stabilization-based control [21]. Moreover, intelligent methods, including fuzzy control [22], model predictive control [23], and reinforcement learning-based control [24], are combined with other control methods to improve the performance of trajectory tracking, taking into account complicated models, environments and system constraints.…”
Section: Introductionmentioning
confidence: 99%