2021
DOI: 10.1109/lra.2021.3070252
|View full text |Cite
|
Sign up to set email alerts
|

Recovery RL: Safe Reinforcement Learning With Learned Recovery Zones

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
65
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5
2
2

Relationship

2
7

Authors

Journals

citations
Cited by 127 publications
(65 citation statements)
references
References 12 publications
0
65
0
Order By: Relevance
“…RL with safe constraints [72] has emerged to consider applications in the real world. Some studies have refined the constraints or rewards for safe ranges in states, which can help in avoidance of unstable areas.…”
Section: Reinforcement Learning For Diverse Fieldsmentioning
confidence: 99%
See 1 more Smart Citation
“…RL with safe constraints [72] has emerged to consider applications in the real world. Some studies have refined the constraints or rewards for safe ranges in states, which can help in avoidance of unstable areas.…”
Section: Reinforcement Learning For Diverse Fieldsmentioning
confidence: 99%
“…In this study, actor-critic methods consisting of deep learning with a safe actor using features of hypo and hyperglycemia were adopted from the perspectives of reinforcement learning and switching controllers for safety. Actorcritic methods have been widely implemented [39], [70], [72]- [74] in many RL applications.…”
Section: E Reinforcement Learning For Regulation Of Glucosementioning
confidence: 99%
“…For the suturing task, including the works related to knot tying and needle insertion, we reported the following: [16], [70], [85], [88], [89], [161], [178], [195]- [197], [203], [243], and [246]. The pick, transfer, and place task was mainly characterized by experiments relying on pegs and rings from the fundamentals of the laparoscopic surgery training paradigm [54], [69], [91], [96], [102], [151]- [153], [163], [264] or new surgical tools [177]. A lot of the remaining works focused on tissue interaction.…”
Section: Instrument Controlmentioning
confidence: 99%
“…In contrast to end-to-end robot policy learning [3,20,21,22], a popular approach for generating behaviors is to parameterize motions by the outputs of a perception network [4,5,10,17,23]. This decouples perception from planning and control, and enables perception systems to be trained in simulation without the need for accurate physical simulation.…”
Section: Parameterized Representations For Manipulationmentioning
confidence: 99%