2020
DOI: 10.48550/arxiv.2006.09939
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Forgetful Experience Replay in Hierarchical Reinforcement Learning from Demonstrations

Abstract: Currently, deep reinforcement learning (RL) shows impressive results in complex gaming and robotic environments. Often these results are achieved at the expense of huge computational costs and require an incredible number of episodes of interaction between the agent and the environment. There are two main approaches to improving the sample efficiency of reinforcement learning methods -using hierarchical methods and expert demonstrations. In this paper, we propose a combination of these approaches that allow th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
4
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(5 citation statements)
references
References 15 publications
0
4
0
Order By: Relevance
“…Minecraft is a voxel-based 3D video game that, due its popularity and wide variety of mechanics, has attracted a vast amount of RL research. 27,28,[30][31][32][33][34][52][53][54][55][56][57][58][59][60] A large body of work focuses on small, custom-made Minecraft worlds with tasks such as navigation, 53,60 block placing, 54,55 instruction following, 58,59 combat, 56 and others. 28,31,57 Work operating in the massive, randomly generated environments of Minecraft itself has included hill climbing, 52 automated curriculum learning 30 and, most closely related to the RL experiments presented in Sec.…”
Section: Preliminaries and Related Workmentioning
confidence: 99%
See 3 more Smart Citations
“…Minecraft is a voxel-based 3D video game that, due its popularity and wide variety of mechanics, has attracted a vast amount of RL research. 27,28,[30][31][32][33][34][52][53][54][55][56][57][58][59][60] A large body of work focuses on small, custom-made Minecraft worlds with tasks such as navigation, 53,60 block placing, 54,55 instruction following, 58,59 combat, 56 and others. 28,31,57 Work operating in the massive, randomly generated environments of Minecraft itself has included hill climbing, 52 automated curriculum learning 30 and, most closely related to the RL experiments presented in Sec.…”
Section: Preliminaries and Related Workmentioning
confidence: 99%
“…4.4, diamond mining. 27,[32][33][34] However, to the best of our knowledge, there is no published work that operates in the full, unmodified human action space, which includes drag-and-drop inventory management and item crafting.…”
Section: Preliminaries and Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…While the most well-known frameworks that give rise and exploit hierarchies on sequential decision making problems remain virtually unchanged [3,36,4,28], their use is both expanding and diversifying [40,15,2]. As an example that attests for its power, hierarchies and reinforcement learning, combined with expert demonstrations, set the state of the art on the game of Minecraft [7,20,33,24]. Hierarchical reinforcement learning is also piercing through the multi-agent field [23,14,1].…”
Section: Introductionmentioning
confidence: 99%