2021
DOI: 10.1101/2021.04.27.441454
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Optimism and pessimism in optimised replay

Abstract: The replay of task-relevant trajectories is known to contribute to memory consolidation and improved task performance. A wide variety of experimental data show that the content of replayed sequences is highly specific and can be modulated by reward as well as other prominent task variables. However, the rules governing the choice of sequences to be replayed still remain poorly understood. One recent theoretical suggestion is that the prioritization of replay experiences in decision-making problems is based on … Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
13
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
1

Relationship

2
3

Authors

Journals

citations
Cited by 7 publications
(14 citation statements)
references
References 72 publications
(178 reference statements)
1
13
0
Order By: Relevance
“…These results are in line with previous reports suggesting a role for replay when planning (Kurth-Nelson et al, 2016) and learning (Schuck & Niv, 2019) trajectories through an abstract state space in a RL problem and evidence from recordings in animals suggesting that replay can explore novel trajectories (Gupta et al, 2010;Ólafsdóttir et al, 2015). Our findings are in line with notions that generative replay provides a mechanism for efficiently learning and sampling from a generative model of the world (Foster, 2017), in line with a crucial role of replay in planning and RL (Antonov et al, 2021;Mattar & Daw, 2018) but also structure learning (Evans & Burgess, 2020). In the present study, we provide evidence for replay as a mechanism for testing hypotheses underlying compositional understanding.…”
Section: Discussionsupporting
confidence: 93%
“…These results are in line with previous reports suggesting a role for replay when planning (Kurth-Nelson et al, 2016) and learning (Schuck & Niv, 2019) trajectories through an abstract state space in a RL problem and evidence from recordings in animals suggesting that replay can explore novel trajectories (Gupta et al, 2010;Ólafsdóttir et al, 2015). Our findings are in line with notions that generative replay provides a mechanism for efficiently learning and sampling from a generative model of the world (Foster, 2017), in line with a crucial role of replay in planning and RL (Antonov et al, 2021;Mattar & Daw, 2018) but also structure learning (Evans & Burgess, 2020). In the present study, we provide evidence for replay as a mechanism for testing hypotheses underlying compositional understanding.…”
Section: Discussionsupporting
confidence: 93%
“…Also, as we will show below the default mode generates shortcuts replays as those found by Gupta et al (2010), whereas the reverse mode does not. Second, the recent discovery of so-called pessimistic replay, i.e., replay of experiences which are not optimal from a reinforcement learning perspective, show that suboptimal replay sequences occur in human brains (Eldar et al, 2020) for a good reason (Antonov et al, 2022).…”
Section: Resultsmentioning
confidence: 99%
“…the replay of sub-optimal experiences, in humans (Eldar et al, 2020) supports this. Interestingly, follow-up modeling work (Antonov et al, 2022) suggests that optimal replay occurs predominantly during initial trials and then switches to pessimistic replay. However, this switch is dependent on the utility of experiences (Mattar and Daw, 2018).…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…More recent work has proposed a richer integration between MF and MB values (e.g., Keramati et al, 2016) as well as following an earlier influential suggestion as to how an MB system might train an MF system (Mattar and Daw, 2018;Sutton, 1991). More recent evidence has raised the possibility that this may occur during rest periods (Antonov et al, 2021;Liu et al, 2021a).…”
Section: Introductionmentioning
confidence: 95%