A gradual temporal shift of dopamine responses mirrors the progression of temporal difference error in machine learning

Amo, Ryunosuke; Matias, Sara; Yamanaka, Akihiro; Tanaka, Kenji F.; Uchida, Naoshige; Watabe-Uchida, Mitsuko

doi:10.1038/s41593-022-01109-2

Cited by 56 publications

(78 citation statements)

References 56 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In the future, examining the pharmacological manipulation of dopamine D1 receptors is an important step for better circuit-level understanding of the neuronal mechanism of the reward prediction and the reward prediction error. Dopamine neurons in VTA show phasic activity to unexpected reward presentations, but phasic activity to the reward decreases as learning progresses, and the neurons show phasic activity to reward-predictive cues (Amo et al, 2022;Schultz et al, 1997). Thus, in our experiments, increases in pupil size may reflect reward prediction errors in the presentation of the auditory stimulus.…”

Section: Resultsmentioning

confidence: 59%

“…However, the neuronal activities of LC neurons in the study of Bouret and Sara (2003) were examined with the reversal of the contingency between the stimulus and the outcome or the re-acquisition after the extinction in the Go/No-Go task. Although the phasic activities to unpredicted reward found in LC neurons (Bouret and Sara, 2004) may be slightly different from those neurons found in dopaminergic neurons found in VTA (Amo et al, 2022;Schultz et al, 1997), both phasic activities of LC and DA neurons are known to show phasic responses to unpredictable events. Moreover, LC neurons show phasic activity in response to a novel stimulus and decreased activity when the stimulus ceases to predict biologically important events (Berridge and Waterhouse, 2003;Vankov et al, 1995).…”

Section: Resultsmentioning

confidence: 66%

“…LC neurons show a burst of activity when the stimuli that predict biologically important events, such as reward and aversive events, are presented (Aston-Jones ad Bloom, 1981;Aston-Jones et al, 1997, 2005Bouret and Richmond, 2015;Bouret and Sara, 2004). LC neurons also show similar activities to dopaminergic neurons in the ventral tegmental area (VTA), such as increased phasic activity in response to unpredicted reward and decreased activity through repeated experience and transfer to a reward-predicting stimulus (Amo et al, 2022;Bouret and Sara, 2004;Schultz et al, 1997). However, the neuronal activities of LC neurons in the study of Bouret and Sara (2003) were examined with the reversal of the contingency between the stimulus and the outcome or the re-acquisition after the extinction in the Go/No-Go task.…”

Section: Resultsmentioning

confidence: 99%

“…Dopamine neurons in VTA show phasic activity to unexpected reward presentations, but phasic activity to the reward decreases as learning progresses, and the neurons show phasic activity to reward-predictive cues (Amo et al, 2022; Schultz et al, 1997). Thus, in our experiments, increases in pupil size may reflect reward prediction errors in the presentation of the auditory stimulus.…”

Section: Discussionmentioning

confidence: 99%

See 3 more Smart Citations

Pupillary Dynamics of Mice Performing a Pavlovian Delay Conditioning Task Reflect Reward-Predictive Signals

Yamada

Toda

2022

Preprint

View full text Add to dashboard Cite

Pupils can signify various internal processes and states, such as attention, arousal, and working memory. Changes in pupil size are reportedly associated with learning speed, prediction of future events, and deviation from prediction in human studies. However, the detailed relationship between pupil size change and prediction is unclear. We explored the dynamics of the pupil size in mice performing a Pavlovian delay conditioning task. The head-fixed experimental setup combined with deep learning-based image analysis enabled us to reduce spontaneous locomotor activity and to track the precise dynamics of the pupil size of behaving mice. By manipulating the predictability of the reward in the Pavlovian delay conditioning task, we demonstrated that the pupil size of mice is modulated by reward prediction and consumption, as well as body movements, but not by the unpredicted reward delivery. Furthermore, we clarified that the pupil size is still modulated by reward prediction, even after the disruption of body movements by intraperitoneal injection of haloperidol, a dopamine D2 receptor antagonist. These results suggest that the changes in the pupil size reflect the reward prediction signals and do not reflect reward prediction error signals, thus we provide important evidence to reconsider the neuronal circuit computing the reward prediction error. This integrative approach of behavioral analysis, image analysis, pupillometry, and pharmacological manipulation will pave the way for understanding the psychological and neurobiological mechanisms of reward prediction and the prediction errors essential to learning and behavior.

show abstract

Section: Resultsmentioning

confidence: 59%

Section: Resultsmentioning

confidence: 66%

Section: Resultsmentioning

confidence: 99%

Section: Discussionmentioning

confidence: 99%

See 2 more Smart Citations

Pupillary Dynamics of Mice Performing a Pavlovian Delay Conditioning Task Reflect Reward-Predictive Signals

Yamada

Toda

2022

Preprint

View full text Add to dashboard Cite

show abstract

“…The state model has a learningrelated temporal shift which leads to rapidly diminished cue-onset prediction errors in the 15 and 30s conditions at 20 trials of training (Figure 5C), in contrast to the data. This occurs due to a moving prediction error that moves backward in time from shock to cue onset over trials (Supplementary Figure 4A-B), a phenomenon which has been recently observed in dopamine neuron responses to appetitive learning (Amo et al, 2022). Such a prediction error can only "arrive" at the cue onset quickly if the distance between cue onset and shock is short.…”

Section: Time Uncertainty Model Matches Learning Effects On Norepinep...mentioning

confidence: 69%

Prefrontal norepinephrine represents a threat prediction error under uncertainty

Basu

Yang

et al. 2022

Preprint

View full text Add to dashboard Cite

Animals must learn to predict constantly varying threats in the environment to survive by enacting defensive behaviors. Dopamine is involved in the prediction of rewards, encoding a reward prediction error in a similar manner to temporal difference learning algorithm. However, the corresponding molecular and computational form of threat prediction errors is not as well-characterized, although norepinephrine and other neuromodulators and neuropeptides participate in fear learning. Here, we utilized fluorescent norepinephrine recordings over the course of fear learning in concert with reinforcement learning modeling to identify its role in the prediction of threat. By varying timing and sensory uncertainty in the formation of threat associations, we were able to define a precise computational role for norepinephrine in this process. Norepinephrine release approximates the strength of fear associations, and its temporal dynamics are compatible with a prediction error signal. Intriguingly, the release of norepinephrine is influenced by time and sensory feedback, serving as an antithesis of the classical reward prediction error role of dopamine. Thus, these results directly demonstrate a combined cognitive and affective role of norepinephrine in the prediction of threat, with implications for neuropsychiatric disorders such as anxiety and PTSD.

show abstract

Impacts of dopamine on learning and behavior in health and disease: Insights from optogenetics in rodents

Campbell,

Green,

Romero Pinto

et al. 2025

Encyclopedia of the Human Brain

View full text Add to dashboard Cite

A gradual temporal shift of dopamine responses mirrors the progression of temporal difference error in machine learning

Cited by 56 publications

References 56 publications

Pupillary Dynamics of Mice Performing a Pavlovian Delay Conditioning Task Reflect Reward-Predictive Signals

Pupillary Dynamics of Mice Performing a Pavlovian Delay Conditioning Task Reflect Reward-Predictive Signals

Prefrontal norepinephrine represents a threat prediction error under uncertainty

Impacts of dopamine on learning and behavior in health and disease: Insights from optogenetics in rodents

Contact Info

Product

Resources

About