A Behavioral Association Between Prediction Errors and Risk-Seeking: Theory and Evidence

Moeller, Moritz; Grohn, Jan; Manohar, Sanjay; Bogacz, Rafał

doi:10.1101/2020.04.29.067751

Cited by 2 publications

(2 citation statements)

References 44 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…However, it has been recently demonstrated that even a brief, burst-like activation of dopaminergic neurons changes the activity levels of striatal neurons (31). Additionally, it has been shown that reward prediction errors modulate the tendency to make risky choices (22), and risk attitudes are known to depend on the balance between the direct and indirect pathways (32, 33). In this paper, we demonstrated that a more realistic assumption, that the dopamine signal encoding prediction error also changes the activity levels in striatum, enables scaling of prediction errors by uncertainty.…”

Section: Discussionmentioning

confidence: 99%

“…In Eq. 7 and 8, 𝐺 and 𝑁 denote the synaptic inputs in the direct and indirect pathway respectively (22), and 𝜆 is a coefficient determining the accuracy with which the standard deviation can be encoded (as explained below). These assumptions can be used to rewrite the learning rules given in Eq.…”

Section: The Spe Learning Rules Are Consistent With Striatal Plasticitymentioning

confidence: 99%

See 1 more Smart Citation

Uncertainty-guided learning with scaled prediction errors in the basal ganglia

Moeller

Manohar

Bogacz

2022

Preprint

Self Cite

View full text Add to dashboard Cite

To accurately predict rewards associated with states or actions, the variability of observations has to be taken into account. In particular, when the observations are noisy, the individual rewards should have less influence on tracking of average reward, and the estimate of the mean reward should be updated to a smaller extent after each observation. However, it is not known how the magnitude of the observation noise might be tracked and used to control prediction updates in the brain reward system. Here, we introduce a new model that uses simple, tractable learning rules that track the mean and standard deviation of reward, and leverages prediction errors scaled by uncertainty as the central feedback signal. We provide a normative analysis, comparing the performance of the new model with that of conventional models in a value tracking task. We find that the new model has an advantage over conventional models when tested across various levels of observation noise. Further, we propose a possible biological implementation of the model in the basal ganglia circuit. The scaled prediction error feedback signal is consistent with experimental findings concerning dopamine prediction error scaling relative to reward magnitude, and the update rules are found to be consistent with many features of striatal plasticity. Our results span across the levels of implementation, algorithm, and computation, and might have important implications for understanding the dopaminergic prediction error signal and its relation to adaptive and effective learning.

show abstract

Section: Discussionmentioning

confidence: 99%

Section: The Spe Learning Rules Are Consistent With Striatal Plasticitymentioning

confidence: 99%

Uncertainty-guided learning with scaled prediction errors in the basal ganglia

Moeller

Manohar

Bogacz

2022

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

Gambling on an empty stomach: Hunger modulates preferences for learned but not described risks

Swieten

Bogacz

Manohar

2021

Preprint

View full text Add to dashboard Cite

We assess risks differently when they are explicitly described, compared to when we learn directly from experience, suggesting dissociable decision-making systems. Our needs, such as hunger, could globally affect our risk preferences, but do they affect described and learned risks equally? On one hand, explicit decision-making is often considered flexible and context-sensitive, and might therefore be modulated by metabolic needs. On the other hand, implicit preferences learned through reinforcement might be more strongly coupled to biological drives. To answer this, we asked participants to choose between two options with different risks, where the probabilities of monetary outcomes were either described or learned. In agreement with previous studies, rewarding contexts induced risk-aversion when risks were explicitly described, but risk-seeking when they were learned through experience. Crucially, hunger attenuated these contextual biases, but only for learned risks. The results suggest that our metabolic state determines risk-taking biases when we lack explicit descriptions.

show abstract

A Behavioral Association Between Prediction Errors and Risk-Seeking: Theory and Evidence

Cited by 2 publications

References 44 publications

Uncertainty-guided learning with scaled prediction errors in the basal ganglia

Uncertainty-guided learning with scaled prediction errors in the basal ganglia

Gambling on an empty stomach: Hunger modulates preferences for learned but not described risks

Contact Info

Product

Resources

About