2022
DOI: 10.1002/hbm.25988
|View full text |Cite
|
Sign up to set email alerts
|

Reinforcement learning with associative or discriminative generalization across states and actions: fMRI at 3 T and 7 T

Abstract: The model-free algorithms of "reinforcement learning" (RL) have gained clout across disciplines, but so too have model-based alternatives. The present study emphasizes other dimensions of this model space in consideration of associative or discriminative generalization across states and actions. This "generalized reinforcement learning" (GRL) model, a frugal extension of RL, parsimoniously retains the single rewardprediction error (RPE), but the scope of learning goes beyond the experienced state and action. I… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
39
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
1

Relationship

4
2

Authors

Journals

citations
Cited by 7 publications
(39 citation statements)
references
References 333 publications
(709 reference statements)
0
39
0
Order By: Relevance
“…The first question relates to robustness over time. Given the tendency for learning-related configurations in the human brain to vary more across rather than within individuals 50 , we employed a between-groups analytic approach inspired by an increasing body of work that uses behavioural profiles to cluster groups of individuals to increase robustness and reliability of hypothesis-specific brain activity 34,51,52 . While the present DDM parsimoniously distinguished human participants based on strategy-specific modulations of drift rate, parameters were necessarily static.…”
Section: Discussionmentioning
confidence: 99%
“…The first question relates to robustness over time. Given the tendency for learning-related configurations in the human brain to vary more across rather than within individuals 50 , we employed a between-groups analytic approach inspired by an increasing body of work that uses behavioural profiles to cluster groups of individuals to increase robustness and reliability of hypothesis-specific brain activity 34,51,52 . While the present DDM parsimoniously distinguished human participants based on strategy-specific modulations of drift rate, parameters were necessarily static.…”
Section: Discussionmentioning
confidence: 99%
“…An additional key outstanding question relates to the robustness of heuristic adoption over time. Given the tendency for learning-related configurations in the human brain to vary more across rather than within individuals 40 , we employed a between-groups analytic approach inspired by an increasing body of work that uses behavioural profiles to cluster groups of individuals to increase robustness and reliability of hypothesis-specific brain activity 24, 41, 53 . While the present DDM parsimoniously distinguished human participants based on strategy-specific modulations of drift-rate, parameters were necessarily static.…”
Section: Discussionmentioning
confidence: 99%
“…The RL framework has appreciable predictive validity [4,5] when accounting for human choices and learning behavior in a variety of settings [6][7][8]-let alone the power of extensions of RL [9][10][11][12]. However, such models sometimes fail to account well for an individual's behavior even in a relatively simple task that should be amenable to RL in principle [13].…”
Section: Introductionmentioning
confidence: 99%
“…Hysteresis is bidirectional as well and adds dynamics in the form of either repetition or alternation of previous actions, which may also manifest for a horizon beyond just the most recent action [17][18][19][20]. Despite at least some precedent for either action bias or action hysteresis (more so the latter), the combination of both bias and hysteresis has even less precedent for RL [12,21].…”
Section: Introductionmentioning
confidence: 99%