2024
DOI: 10.3389/fnbeh.2023.1302842
|View full text |Cite
|
Sign up to set email alerts
|

A reinforcement learning model with choice traces for a progressive ratio schedule

Keiko Ihara,
Yu Shikano,
Sae Kato
et al.

Abstract: The progressive ratio (PR) lever-press task serves as a benchmark for assessing goal-oriented motivation. However, a well-recognized limitation of the PR task is that only a single data point, known as the breakpoint, is obtained from an entire session as a barometer of motivation. Because the breakpoint is defined as the final ratio of responses achieved in a PR session, variations in choice behavior during the PR task cannot be captured. We addressed this limitation by constructing four reinforcement learnin… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(3 citation statements)
references
References 68 publications
0
3
0
Order By: Relevance
“…Perseveration and action repetition in this context have been related to the functions of dopamine [20,144,[176][177][178][179][180][181] (but see [101,182]) as well as perhaps serotonin [177,183] (but see [101]). The theory here can take into account the roles of dopaminergic systems for not only computations such as the reward-prediction error [184][185][186] but also motivation, vigor, effort, and skillful execution of movement [187][188][189][190][191][192].…”
Section: Bidirectional Hysteretic Biasmentioning
confidence: 99%
See 2 more Smart Citations
“…Perseveration and action repetition in this context have been related to the functions of dopamine [20,144,[176][177][178][179][180][181] (but see [101,182]) as well as perhaps serotonin [177,183] (but see [101]). The theory here can take into account the roles of dopaminergic systems for not only computations such as the reward-prediction error [184][185][186] but also motivation, vigor, effort, and skillful execution of movement [187][188][189][190][191][192].…”
Section: Bidirectional Hysteretic Biasmentioning
confidence: 99%
“…Like H t (a), its counterpart H t (s t ,a) can also be modeled with the accumulating hysteresis trace [21]. Along with the alternative of a replacing trace (see Methods), another more constrained implementation of hysteretic accumulation could be based on an action-prediction error (or choice-prediction error) with analogy to the reward-prediction error [40,[42][43][44][45][46][47]96,143,144,178,181]. The actionprediction error has been framed as "value-free", but this label and that of H t (s t ,a) as "habit strength" (cf.…”
Section: Plos Computational Biologymentioning
confidence: 99%
See 1 more Smart Citation