A cholinergic feedback circuit to regulate striatal population uncertainty and optimize reinforcement learning

Franklin, Nicholas T.; Frank, Michael J.

doi:10.7554/elife.12029

Cited by 85 publications

(81 citation statements)

References 85 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…While the two functions are not mutually exclusive, our data provide strong support for the second interpretation: On a trial-by-trial basis, the degree of ramping across regions was related to the latency to reward peak elicited by the wave, and the combination of ramp slope and wave magnitude was predictive of subsequent-trial behavioral adjustments. These findings accord with views that dopamine signals can have different functions during reward pursuit and outcome, which can be gated by local microcircuit elements that regulate plasticity windows (Berke, 2018;Bradfield et al, 2013;Franklin and Frank, 2015;Morris et al, 2004;Threlfell and Cragg, 2011) . Moreover, we also interpret transient and localized RPEs during reward pursuit as facilitating inference about the current task state (i.e., determining credit), whereas RPEs during reward itself facilitates reinforcement learning; a dual operation that can also be gated (Franklin and Frank, 2015;Gershman et al, 2015;Redish et al, 2007;Schoenbaum et al, 2013) .…”

Section: Discussionsupporting

confidence: 88%

“…These findings accord with views that dopamine signals can have different functions during reward pursuit and outcome, which can be gated by local microcircuit elements that regulate plasticity windows (Berke, 2018;Bradfield et al, 2013;Franklin and Frank, 2015;Morris et al, 2004;Threlfell and Cragg, 2011) . Moreover, we also interpret transient and localized RPEs during reward pursuit as facilitating inference about the current task state (i.e., determining credit), whereas RPEs during reward itself facilitates reinforcement learning; a dual operation that can also be gated (Franklin and Frank, 2015;Gershman et al, 2015;Redish et al, 2007;Schoenbaum et al, 2013) . Put together, the synthesis of our data and computational simulations imply that dopamine signals are spatio-temporally vectorized during both epochs, tailored to underlying region's computational specialty.…”

Section: Discussionsupporting

confidence: 88%

See 1 more Smart Citation

Dopamine waves as a mechanism for spatiotemporal credit assignment

Hamid

Frank

Moore

2019

Preprint

View full text Add to dashboard Cite

Significant evidence supports the view that dopamine shapes reward-learning by encoding prediction errors. However, it is unknown whether dopamine decision-signals are tailored to the functional specialization of target regions. Here, we report a novel set of wave-like spatiotemporal activity-patterns in dopamine axons across the dorsal striatum. These waves switch between different activational motifs and organize dopamine transients into localized clusters within functionally related striatal subregions. These specific motifs are associated with distinct task contexts: At reward delivery, dopamine signals rapidly resynchronize into propagating waves with opponent directions depending on instrumental task contingencies. Moreover, dopamine dynamics during reward pursuit signal the extent to which mice have instrumental control and interact with reward waves to predict future behavioral adjustments. Our results are consistent with a computational architecture in which striatal dopamine signals are sculpted by inference about instrumental controllability and provide evidence for a spatiotemporally "vectorized" role of dopamine in credit assignment.

show abstract

Section: Discussionsupporting

confidence: 88%

Section: Discussionsupporting

confidence: 88%

Dopamine waves as a mechanism for spatiotemporal credit assignment

Hamid

Frank

Moore

2019

Preprint

View full text Add to dashboard Cite

show abstract

“…In their model, the estimate of reward probability is updated only if a change is detected and if so, a new estimate of reward probability can be made depending on the location of the detected change. Interestingly, a recent modeling study has shown that increased responsiveness to change-points can be instantiated by pauses in tonically active interneurons in the striatum enabling the modulation of learning rate by reward uncertainty (Franklin and Frank, 2015). Although we did not incorporate a change-detection mechanism, such a mechanism would only improve the performance of our model (Gallistel et al, 2001; McGuire et al, 2014).…”

Section: Discussionmentioning

confidence: 99%

Metaplasticity as a Neural Substrate for Adaptive Learning and Choice under Uncertainty

Farashahi

Donahue

Khorsand

et al. 2017

Neuron

101

111

View full text Add to dashboard Cite

Summary Value-based decision making often involves integration of reward outcomes over time, but this becomes considerably more challenging if reward assignments on alternative options are probabilistic and non-stationary. Despite the existence of various models for optimally integrating reward under uncertainty, the underlying neural mechanisms are still unknown. Here we propose that reward-dependent metaplasticity (RDMP) can provide a plausible mechanism for both integration of reward under uncertainty and estimation of uncertainty itself. We show that a model based on RDMP can robustly perform the probabilistic reversal learning task via dynamic adjustment of learning based on reward feedback, while changes in its activity signal unexpected uncertainty. The model predicts time-dependent and choice-specific learning rates which strongly depend on reward history. Key predictions from this model were confirmed with behavioral data from non-human primates. Overall, our results suggest that metaplasticity can provide a neural substrate for adaptive learning and choice under uncertainty.

show abstract

“…This suggests a more complex mechanism in which perseveration is influenced, in part, by the learning rate from negative prediction errors (which can change due to task demand) and by resting levels of DS CHO. Indeed, Franklin et al (2015) showed that a model which takes into account cholinergic activity performs better on a reversal learning task than a model based solely on dopamine prediction error signalling (Franklin & Frank, 2015).…”

Section: Discussionmentioning

confidence: 99%

Regional Striatal Cholinergic Involvement in Human Behavioural Flexibility

Bell

Lindner

Langdon

et al. 2018

Preprint

View full text Add to dashboard Cite

Animal studies have shown that the striatal cholinergic system plays a role in behavioural flexibility but, until recently, this system could not be studied in humans due to a lack of appropriate noninvasive techniques. Using proton magnetic resonance spectroscopy ( 1 H-MRS) we recently showed that the concentration of dorsal striatal choline (an acetylcholine precursor) changes during reversal learning (a measure of behavioural flexibility) in humans. The aim of the present study was to examine whether regional average striatal choline was associated with reversal learning. We measured choline at rest in both the dorsal and ventral striatum using 1 H-MRS and examined its relationship with performance on a probabilistic learning task with a reversal component. Task performance was described using a simple reinforcement learning model that dissociates the contributions of positive and negative prediction errors to learning. Average levels of choline in the dorsal striatum were associated with performance during reversal, but not during initial learning.Specifically, lower levels of choline in the dorsal striatum were associated with a lower number of perseverative trials. Moreover, choline levels explained inter-individual variance in perseveration over and above that explained by learning from negative prediction errors. These findings suggest that the dorsal striatal cholinergic system plays an important role in behavioural flexibility, in line with evidence from the animal literature and our previous work in humans. Additionally, this work provides further support for the idea of measuring choline with 1 H-MRS as a non-invasive way of studying human cholinergic neurochemistry.

show abstract

A cholinergic feedback circuit to regulate striatal population uncertainty and optimize reinforcement learning

Cited by 85 publications

References 85 publications

Dopamine waves as a mechanism for spatiotemporal credit assignment

Dopamine waves as a mechanism for spatiotemporal credit assignment

Metaplasticity as a Neural Substrate for Adaptive Learning and Choice under Uncertainty

Regional Striatal Cholinergic Involvement in Human Behavioural Flexibility

Contact Info

Product

Resources

About