Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence 2020
DOI: 10.24963/ijcai.2020/401
|View full text |Cite
|
Sign up to set email alerts
|

Independent Skill Transfer for Deep Reinforcement Learning

Abstract: Recently, diverse primitive skills have been learned by adopting the entropy as intrinsic reward, which further shows that new practical skills can be produced by combining a variety of primitive skills. This is essentially skill transfer, very useful for learning high-level skills but quite challenging due to the low efficiency of transferring primitive skills. In this paper, we propose a novel efficient skill transfer method, where we learn independent skills and only independent components of skills… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 6 publications
(3 citation statements)
references
References 6 publications
0
3
0
Order By: Relevance
“…Thus, the desirability of our approach is that the acquired reward function uncovers both the source dynamics (q φ ) and the dynamics difference (β∆r) across source and target environment. Complementary to our work, several other works also encourage the emergence of a state-covering goal distribution [37,6,27] or enable transfer by introducing the regularization over policies [45,15,46,47,36,48] instead of the adaptation over different dynamics.…”
Section: Related Workmentioning
confidence: 82%
“…Thus, the desirability of our approach is that the acquired reward function uncovers both the source dynamics (q φ ) and the dynamics difference (β∆r) across source and target environment. Complementary to our work, several other works also encourage the emergence of a state-covering goal distribution [37,6,27] or enable transfer by introducing the regularization over policies [45,15,46,47,36,48] instead of the adaptation over different dynamics.…”
Section: Related Workmentioning
confidence: 82%
“…DIAYN [7] created the inferred relationship between options, states, and actions. IST [30] encourages the agent to discover states that are challenging to the policy so that more complex actions can be obtained through options. HIDIO [33] employed continuous options for the first time and compared the effects of different mutual information forms.…”
Section: Representationmentioning
confidence: 99%
“…The limited * Corresponding Author. Email:quanliu@suda.edu.cn number requires them to be empirically explainable [28] and nonredundant [30]. Even as the number of discrete options increases, it is still unrealistic for the option policy to traverse each option value.…”
Section: Introductionmentioning
confidence: 99%