2020
DOI: 10.1109/access.2020.3012204
|View full text |Cite
|
Sign up to set email alerts
|

Batch Prioritization in Multigoal Reinforcement Learning

Abstract: In multigoal reinforcement learning, an agent interacts with an environment and learns to achieve multiple goals. The goal-conditioned policy is trained to effectively generalize its behavior for multiple goals. During training, the experiences collected by the agent are randomly sampled from a replay buffer. Because biased sampling of achieved goals affects the success rate of a given task, it should be avoided by considering the valid goal space, introduced here as the set of goals to achieve, and the curren… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 13 publications
(3 citation statements)
references
References 17 publications
(22 reference statements)
0
3
0
Order By: Relevance
“…The three tasks are Push, PickAndPlace, and Slide tasks. The integration of the proposed sampling strategy into HER is based on the implementation in (Vecchietti et al 2020).…”
Section: Resultsmentioning
confidence: 99%
“…The three tasks are Push, PickAndPlace, and Slide tasks. The integration of the proposed sampling strategy into HER is based on the implementation in (Vecchietti et al 2020).…”
Section: Resultsmentioning
confidence: 99%
“…In practice, older experiences often originate from unsuccessful episodes, whereas recent experiences are often from successful episodes and thus have more useful information. For instance, unsuccessful episodes in the initial stage mostly contain environment steps with inappropriate action choices [21] that do not provide useful information. However, an RL agent should not aggressively utilize newer experiences and ignore the older ones, which causes the agent's failure to generalize the knowledge and to experience a catastrophic forgetting [22].…”
Section: Introductionmentioning
confidence: 99%
“…Recently, deep reinforcement learning (DRL) has been widely applied to deterministic games ( Silver et al, 2018 ), video games ( Mnih et al, 2015 ; Mnih et al, 2016 ; Silver et al, 2016 ), sensor networks ( Kim et al, 2020 ), and complex robotic tasks ( Andrychowicz et al, 2017 ; Hwangbo et al, 2019 ; Seo et al, 2019 ; Vecchietti et al, 2020 ; Vecchietti, Seo & Har, 2020 ). Despite the breakthrough results achieved in the field of DRL, deep learning in multi-agent environments that require both cooperation and competition is still challenging.…”
Section: Introductionmentioning
confidence: 99%