2020
DOI: 10.48550/arxiv.2012.06662
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Protective Policy Transfer

Abstract: Being able to transfer existing skills to new situations is a key capability when training robots to operate in unpredictable real-world environments. A successful transfer algorithm should not only minimize the number of samples that the robot needs to collect in the new environment, but also prevent the robot from damaging itself or the surrounding environment during the transfer process. In this work, we introduce a policy transfer algorithm for adapting robot motor skills to novel scenarios while minimizin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 28 publications
0
2
0
Order By: Relevance
“…We formulate the problem of safe locomotion learning in the context of safe RL. Inspired by prior work [5], [6], [7], Catwalk Two-leg balance Fig. 1: We evaluate our algorithm in legged locomotion tasks: catwalk and two-leg balance.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…We formulate the problem of safe locomotion learning in the context of safe RL. Inspired by prior work [5], [6], [7], Catwalk Two-leg balance Fig. 1: We evaluate our algorithm in legged locomotion tasks: catwalk and two-leg balance.…”
Section: Introductionmentioning
confidence: 99%
“…Different from prior methods that learn a safety critic function which predicts the possibility of safe violations [5], [6], [7], we propose a model-based approach to determine when to switch between the two policies based on the knowledge about the system dynamics. In reality, it is often the case that the designer has some knowledge of the system dynamics at hand.…”
Section: Introductionmentioning
confidence: 99%