2018
DOI: 10.48550/arxiv.1805.07603
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Episodic Memory Deep Q-Networks

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
21
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
2
2
2

Relationship

0
6

Authors

Journals

citations
Cited by 11 publications
(21 citation statements)
references
References 7 publications
0
21
0
Order By: Relevance
“…Episodic control enables sample-efficiency through explicitly storing the association between returns and state-action pairs in episodic memory [29,4,36]. When combined with Q-learning (habitual control), the episodic memory augments the value function with episodic value estimation, which is shown beneficial to guide the RL agent to latch on good policies during early training [30,46,20].…”
Section: Memory-based Controlsmentioning
confidence: 99%
See 4 more Smart Citations
“…Episodic control enables sample-efficiency through explicitly storing the association between returns and state-action pairs in episodic memory [29,4,36]. When combined with Q-learning (habitual control), the episodic memory augments the value function with episodic value estimation, which is shown beneficial to guide the RL agent to latch on good policies during early training [30,46,20].…”
Section: Memory-based Controlsmentioning
confidence: 99%
“…We note that our memory writing allows multiple updates to multiple neighbor slots, which is unlike the single-slot update rule [4,36,30]. Here, the written value is the Monte Carlo return collected from t + 1 to the end of the episode…”
Section: Memory Operatorsmentioning
confidence: 99%
See 3 more Smart Citations