2021
DOI: 10.48550/arxiv.2102.11319
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Stratified Experience Replay: Correcting Multiplicity Bias in Off-Policy Reinforcement Learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 0 publications
0
1
0
Order By: Relevance
“…Experience Replay (Lin, 1992) was a fundamental idea to reinforcement learning and is still being investigated by many researchers to understand its contributions and propose improvements (Andrychowicz et al, 2017;Pan et al, 2018;Novati and Koumoutsakos, 2019;Zha et al, 2019;Fedus et al, 2020;Wei et al, 2021;Daley et al, 2021). In this work, we present a method that addresses the experience memory and how we can model and explore it to make the agents with ER more efficient in using smaller amounts of data.…”
Section: Literature Review and Related Workmentioning
confidence: 99%
“…Experience Replay (Lin, 1992) was a fundamental idea to reinforcement learning and is still being investigated by many researchers to understand its contributions and propose improvements (Andrychowicz et al, 2017;Pan et al, 2018;Novati and Koumoutsakos, 2019;Zha et al, 2019;Fedus et al, 2020;Wei et al, 2021;Daley et al, 2021). In this work, we present a method that addresses the experience memory and how we can model and explore it to make the agents with ER more efficient in using smaller amounts of data.…”
Section: Literature Review and Related Workmentioning
confidence: 99%