2021
DOI: 10.1016/j.neunet.2021.05.030
|View full text |Cite
|
Sign up to set email alerts
|

The asymmetric learning rates of murine exploratory behavior in sparse reward environments

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
25
1

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 19 publications
(26 citation statements)
references
References 39 publications
0
25
1
Order By: Relevance
“…(1) The Simple Q-learning model has single state value (hereafter, “SimpleQ”). (2) The Asymmetry model has independent learning rates for positive and negative reward prediction errors (Katahira et al, 2017b; Lefebvre et al, 2017; Ohta et al, 2021). (3) The Perseverance model has a choice auto- correlation to incorporate perseverance in action selection (Katahira, 2018; Lau and Glimcher, 2005).…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…(1) The Simple Q-learning model has single state value (hereafter, “SimpleQ”). (2) The Asymmetry model has independent learning rates for positive and negative reward prediction errors (Katahira et al, 2017b; Lefebvre et al, 2017; Ohta et al, 2021). (3) The Perseverance model has a choice auto- correlation to incorporate perseverance in action selection (Katahira, 2018; Lau and Glimcher, 2005).…”
Section: Methodsmentioning
confidence: 99%
“…( 1) The Simple Q-learning model has single state value (hereafter, "SimpleQ"). ( 2) The Asymmetry model has independent learning rates for positive and negative reward prediction errors (Katahira et al, 2017b;Lefebvre et al, 2017;Ohta et al, 2021). ( 3)…”
Section: Computational Modelsmentioning
confidence: 99%
“…In Appendix 5, as an extension of the actor-critic model, we consider the asymmetries in learning that have been considered mainly in action value-based models in the previous literature (Frank et al 2007;Niv et al 2012;Gershman 2015;Lefebvre et al 2017;Ohta et al 2021).…”
Section: Mapping Actor-critic Learning To Q-learningmentioning
confidence: 99%
“…In this Appendix, we consider an extension of the actorcritic model and discuss its statistical properties. Specifically, we consider the asymmetries in learning that have been considered mainly in action value-based models in the previous literature (Frank et al 2007;Niv et al 2012;Gershman 2015;Lefebvre et al 2017;Ohta et al 2021). In actor-critic learning, two types of asymmetry, namely, asymmetries in the critic (state value update) and the actor (policy update), can be considered, although such models have not yet been used for model fitting to behavioral data.…”
Section: Appendix 5 Asymmetric Learning In Actor-critic Learningmentioning
confidence: 99%
“…In animal literature, the Anterior Cingulate Cortex (ACC) has been identified as a major modulator of explore-exploit decisions. Versions of the n-armed bandits have been fitted for rats and mice with the use of n-armed radial mazes (Ohta et al 2021). Anterior Cingulate Cortex (ACC) activation has been linked to foraging in rats in an adapted patch foraging task (Kane et al 2022) and a two-armed bandit monkey lesion study (Kennerley et al 2006).…”
Section: Introductionmentioning
confidence: 99%