2020
DOI: 10.1007/s00773-020-00755-0
|View full text |Cite
|
Sign up to set email alerts
|

Automatic ship collision avoidance using deep reinforcement learning with LSTM in continuous action spaces

Abstract: This paper presents an automatic collision avoidance algorithm for ships using a deep reinforcement learning (DRL) in continuous action spaces. Obstacle zone by target (OZT) is used to compute an area where a collision will happen in the future based on dynamic information of ships. Agents of DRL detects the approach of multiple ships using a virtual sensor called the grid sensor. Agents learned collision avoidance maneuvering through Imazu problem, which is a scenario set of ship encounter situations. In this… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
23
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
9
1

Relationship

0
10

Authors

Journals

citations
Cited by 89 publications
(23 citation statements)
references
References 17 publications
0
23
0
Order By: Relevance
“…To adds up the lengths of own and targeted is a simple way but not practical given the maneuverability of ships. Lyu [38] and Sawada [39] used the area with radius of 0.3 nm as the prohibited zone in their collision avoidance experiments, Sawada also took an effort with 0.5 nm but the performance was not as satisfied.…”
Section: Detection Rangesmentioning
confidence: 99%
“…To adds up the lengths of own and targeted is a simple way but not practical given the maneuverability of ships. Lyu [38] and Sawada [39] used the area with radius of 0.3 nm as the prohibited zone in their collision avoidance experiments, Sawada also took an effort with 0.5 nm but the performance was not as satisfied.…”
Section: Detection Rangesmentioning
confidence: 99%
“…A DRL-based COLREGs-compliant algorithm was proposed for multi-ship collision avoidance [ 29 ]. Sawada, et al [ 30 ] extended the DRL for continuous action spaces using an automatic collision avoidance algorithm. Researchers redesigned the long short-term memory (LSTM) network and trained the model in continuous action spaces.…”
Section: Related Workmentioning
confidence: 99%
“…Xie et al (2020) combined the long short-term memory neural network (LSTM) inverse model-based controller and the model-free A3C policy, to achieve ship collision avoidance under unknown environments. An automatic collision avoidance algorithm was proposed by combining the LSTM and RL in continuous action spaces (Sawada et al, 2021). However, deep learning has, upon the authors' best knowledge, yet been applied for end-to-end adaptive navigation, largely due to the difficulty by the complex and changeable marine environment.…”
Section: Introductionmentioning
confidence: 99%