2020
DOI: 10.1101/2020.05.21.108886
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Ramping activity in midbrain dopamine neurons signifies the use of a cognitive map

Abstract: Journeys to novel and familiar destinations employ different navigational strategies. The 1 first drive to a new restaurant relies on map-based planning, but after repeated trips the 2 drive is automatic and guided by local environmental cues 1,2 . Ventral striatal dopamine rises 3 during navigation toward goals and reflects the spatial proximity and value of goals 3 , but the 4 impact of experience, the neural mechanisms, and the functional significance of dopamine 5 ramps are unknown 4,5 . Here, we used fib… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

5
58
2

Year Published

2020
2020
2022
2022

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 37 publications
(65 citation statements)
references
References 80 publications
5
58
2
Order By: Relevance
“…Animals transition between directed pursuit of rewards and exploratory or quiescent behavioral states on a timescale of minutes to hours (Ferster and Skinner, 1957;Cohen et al, 2007;Flavell et al, 2013;Hills et al, 2015;Stern et al, 2017;Ebitz et al, 2018;Marques et al, 2020). Factors that influence the persistence of reward-seeking behavioral states include current and predicted homeostatic need (Aponte et al, 2011;Chen et al, 2015), reward proximity (Howe et al, 2013;McGinty et al, 2013;Westbrook and Frank, 2018;Guru et al, 2020), the history of action successes and failures (Vroom, 1964;Charnov, 1976;Ullsperger and von Cramon, 2003;Ebitz et al, 2019), opportunity costs (Niv et al, 2007;Kurzban et al, 2013;Boureau et al, 2015), and environmental threats (Lecca et al, 2017;Alhadeff et al, 2018).…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Animals transition between directed pursuit of rewards and exploratory or quiescent behavioral states on a timescale of minutes to hours (Ferster and Skinner, 1957;Cohen et al, 2007;Flavell et al, 2013;Hills et al, 2015;Stern et al, 2017;Ebitz et al, 2018;Marques et al, 2020). Factors that influence the persistence of reward-seeking behavioral states include current and predicted homeostatic need (Aponte et al, 2011;Chen et al, 2015), reward proximity (Howe et al, 2013;McGinty et al, 2013;Westbrook and Frank, 2018;Guru et al, 2020), the history of action successes and failures (Vroom, 1964;Charnov, 1976;Ullsperger and von Cramon, 2003;Ebitz et al, 2019), opportunity costs (Niv et al, 2007;Kurzban et al, 2013;Boureau et al, 2015), and environmental threats (Lecca et al, 2017;Alhadeff et al, 2018).…”
Section: Introductionmentioning
confidence: 99%
“…Indeed, there is anatomical and behavioral evidence that supports this idea. Midbrain dopamine (DA) neurons play an essential role in supporting sustained goal-directed behavior (Salamone and Correa, 2012;Dolan and Dayan, 2013;Howe et al, 2013;Guru et al, 2020), and the LHb inhibits DA neural activity via the GABAergic rostromedial tegmental nucleus (RMTg) (Christoph et al, 1986;Ji and Shepard, 2007;Matsumoto and Hikosaka, 2007;Hong et al, 2011). LHb lesions reduce reward omission dips in DA neural activity (Tian and Uchida, 2015), LHb stimulation reduces the number of actions that animals are willing to perform for rewards (Proulx et al, 2018), and LHb stimulation reduces entries to spatial locations where stimulation is delivered (Stamatakis and Stuber, 2012).…”
Section: Introductionmentioning
confidence: 99%
“…In addition to the phasic modulation of DA activity in response to reward or reward-predicting cues, there are an increasing number of reports suggesting that activity may "ramp" as animals approach a reward in space (22,(52)(53)(54)(55)(56) . Most of these studies were based on calcium imaging (22,56) , or on measures of downstream DA release (52,53,55) , but more recently evidence has emerged from in vivo electrophysiology of ramps in the tonic firing of VTA DA neurons during reward approach (54) .…”
Section: Application Of Spike Inference To In Vivo Da Calcium Imagingmentioning
confidence: 99%
“…Successor representation and value update based on it have been suggested to be implemented in the prefrontal/hippocampus-dorsomedial/ventral striatum circuits (Garvert et al , 2017; Russek et al , 2017; Stachenfeld et al , 2017), while circuits including dorsolateral striatum might implement habitual or model-free behavior through “punctate” (i.e., individual) representation of states or actions. Sustained DA response to predictable reward, possibly related to state representation The original experiments that led to the proposal of representation of RPE by DA (Montague et al , 1996; Schultz et al , 1997) have shown that DA response to reward disappears after monkeys repeatedly experienced the stimulus(-action)-reward association and the reward presumably became predictable for them. However, sustained, and often ramping, dopamine signals to/towards (apparently) predictable reward has been widely observed in recent years (Howe et al , 2013; Collins et al , 2016; Hamid et al , 2016; Hamid et al , 2019; Kim et al , 2019; Mohebi et al , 2019; Guru et al , 2020; Sarno et al , 2020). There are a number of possible accounts for such sustained DA signals, positing that they represent RPE (Gershman, 2014; Morita & Kato, 2014; Kato & Morita, 2016; Kim et al , 2019; Mikhael et al , 2019; Song & Lee, 2020) or something different from RPE (Howe et al , 2013; Hamid et al , 2016; Hamid et al , 2019; Mohebi et al , 2019; Guru et al , 2020; Sarno et al , 2020) or both (Lloyd & Dayan, 2015; Collins et al , 2016).…”
Section: Introductionmentioning
confidence: 99%
“…However, sustained, and often ramping, dopamine signals to/towards (apparently) predictable reward has been widely observed in recent years (Howe et al , 2013; Collins et al , 2016; Hamid et al , 2016; Hamid et al , 2019; Kim et al , 2019; Mohebi et al , 2019; Guru et al , 2020; Sarno et al , 2020). There are a number of possible accounts for such sustained DA signals, positing that they represent RPE (Gershman, 2014; Morita & Kato, 2014; Kato & Morita, 2016; Kim et al , 2019; Mikhael et al , 2019; Song & Lee, 2020) or something different from RPE (Howe et al , 2013; Hamid et al , 2016; Hamid et al , 2019; Mohebi et al , 2019; Guru et al , 2020; Sarno et al , 2020) or both (Lloyd & Dayan, 2015; Collins et al , 2016). Of particular interest to our present work, one hypothesis (Gershman, 2014) suggests that sustained (ramping) DA signals might represent sustained RPE generated due to imperfect approximation of value function in the system using representation of states by low-dimensional features.…”
Section: Introductionmentioning
confidence: 99%