2021
DOI: 10.1109/lra.2021.3097660
|View full text |Cite
|
Sign up to set email alerts
|

Towards Safe Control of Continuum Manipulator Using Shielded Multiagent Reinforcement Learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 24 publications
(11 citation statements)
references
References 22 publications
0
11
0
Order By: Relevance
“…1. Multi-agent Deep Q-Network (MADQN) [40]: This is the multi-agent version of Deep Q-Network (DQN) [25], which is an off-policy RL method by applying a deep neural network to approximate the value function and an experience replay buffer to break the correlations between samples to stabilize the training.…”
Section: Comparison With the State-of-the-art Benchmarksmentioning
confidence: 99%
“…1. Multi-agent Deep Q-Network (MADQN) [40]: This is the multi-agent version of Deep Q-Network (DQN) [25], which is an off-policy RL method by applying a deep neural network to approximate the value function and an experience replay buffer to break the correlations between samples to stabilize the training.…”
Section: Comparison With the State-of-the-art Benchmarksmentioning
confidence: 99%
“…They range from using continuum robot arms with one segment ( Chattopadhyay et al, 2018 ; Satheeshbabu et al, 2019 , 2020 ) to two ( Yang et al, 2019 ), three ( Zhang et al, 2017 ), and four ( You et al, 2017 ) segments. There are also other studies that use multi-agent reinforcement learning in which each actuator of a multi-degree-of-freedom arm is considered as one agent ( Ansari et al, 2018 ; Perrusquía et al, 2020 ; Ji et al, 2021 ). Furthermore, there are studies that use reinforcement learning for the reaching component of the hierarchical control of tasks involving interactions with the environment of a continuum robot arm ( Jiang et al, 2021 ).…”
Section: Related Workmentioning
confidence: 99%
“…For continuum robots (15) with disturbance d(t), which is bounded and whose derivative exists, the dynamic model can be expressed as B q (q)q + C q (q, q) q + K q q + D q (q) q = τ − d(t). (21) Then a nominal SMC controller can be designed as…”
Section: A Tailored Sliding Mode Control Schemementioning
confidence: 99%
“…Moreover, when considering the uncertainties of the continuum robots, including both the intervention from the environment and the parameter perturbation of the system, it is hard to achieve high precision control for continuum robots in a complex uncertain environment. To this end, G. Ji et al presented the superiority of controlling a cable-driven continuum robot using a multi-agent DQN with shielding [21]. Although their work considered the external payload, soft and hard interference, they did not consider the internal uncertainties.…”
Section: Introductionmentioning
confidence: 99%