Dual Competition between the Basal Ganglia and the Cortex: from Action-Outcome to Stimulus-Response

Topalidou, Meropi; Kase, Daisuke; Boraud, Thomas; Rougier, Nicolas P.

doi:10.1101/187294

Cited by 2 publications

(2 citation statements)

References 86 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…These models propose that novel behaviors are first acquired via a dopamine-dependent plasticity mechanism within the basal ganglia, and that with consistent performance, control of behavior is transferred to cortex via a Hebbian cortico–cortical plasticity mechanism. This idea has been applied to categorization learning and instrumental conditioning (Ashby et al, 2007), sequence learning (Hélie, Roeder, Vucovich, Rünger, & Ashby, 2015) and to action selection in probabilistic environments (Topalidou, Kase, Boraud, & Rougier, 2017), and it has been suggested to describe how basal-ganglia-dependent behavior becomes automatic in general (Hélie, Ell, & Ashby, 2015). Though the cortical module of these models is conceptually similar to our habitual controller, the basal ganglia module is different from our goal-directed controller in important ways: its learning rule instantiates a version of model-free RL, which tends to repeat actions in situations where they have led to reinforcement in the past, but does not learn about the particular outcomes that are expected to follow each action.…”

Section: Discussionmentioning

confidence: 99%

Habits without values.

Miller

Shenhav

Ludvig

2019

Psychological Review

224

185

View full text Add to dashboard Cite

Habits form a crucial component of behavior. In recent years, key computational models have conceptualized habits as arising from model-free reinforcement learning mechanisms, which typically select between available actions based on the future value expected to result from each. Traditionally, however, habits have been understood as behaviors that can be triggered directly by a stimulus, without requiring the animal to evaluate expected outcomes. Here, we develop a computational model instantiating this traditional view, in which habits develop through the direct strengthening of recently taken actions rather than through the encoding of outcomes. We demonstrate that this model accounts for key behavioral manifestations of habits, including insensitivity to outcome devaluation and contingency degradation, as well as the effects of reinforcement schedule on the rate of habit formation. The model also explains the prevalent observation of perseveration in repeated-choice tasks as an additional behavioral manifestation of the habit system. We suggest that mapping habitual behaviors onto value-free mechanisms provides a parsimonious account of existing behavioral and neural data. This mapping may provide a new foundation for building robust and comprehensive models of the interaction of habits with other, more goal-directed types of behaviors and help to better guide research into the neural mechanisms underlying control of instrumental behavior more generally.

show abstract

Section: Discussionmentioning

confidence: 99%

Habits without values.

Miller

Shenhav

Ludvig

2019

Psychological Review

224

185

View full text Add to dashboard Cite

show abstract

“…These models propose that novel behaviors are first acquired via a dopamine-dependent plasticity mechanism within the basal ganglia, and that with consistent performance, control of behavior is transferred to cortex via a Hebbian cortico-cortical plasticity mechanism. Developed first in the context of categorization learning (Ashby, Ennis, & Spiering, 2007) , this idea has recently been applied to sequence learning (Hélie, Roeder, Vucovich, Rünger, & Ashby, 2015) and to action selection in probabilistic environments (Topalidou, Kase, Boraud, & Rougier, 2017) , and it has been suggested to describe how basal-ganglia-dependent behavior becomes automatic in general (Hélie, Ell, & Ashby, 2015) . Though the cortical module of these models is conceptually similar to our habitual controller, the basal ganglia module is different from our goal-directed controller in important ways: its learning rule instantiates a version of model-free RL, which tends to repeat actions in situations where they have led to reinforcement in the past, but does not learn about the particular outcomes that are expected to follow each action.…”

Section: Relationship To Previous Computational Modelsmentioning

confidence: 99%

Habits without Values

Miller

Shenhav

Ludvig

2016

Preprint

115

View full text Add to dashboard Cite

Habits form a crucial component of behavior. In recent years, key computational models have conceptualized habits as behaviors arising from model-free reinforcement learning (RL) mechanisms, which typically represent the expected value associated with the possible outcomes of each action before one of those actions is chosen. Traditionally, however, habits have been understood as arising from mechanisms that are independent of outcomes. Here, we develop a computational model instantiating this traditional view, in which habits are acquired through the direct strengthening of recently taken actions, independent of outcome. We demonstrate how this model accounts for key behavioral manifestations of habits, including insensitivity to outcome devaluation and contingency degradation, as well as the dependence of formation rates on the reinforcement schedule. The model also explains the prevalent observation of perseveration in repeated choice tasks as an additional behavioral manifestation of the habit system. We suggest that mapping habitual behaviors onto value-free mechanisms provides a parsimonious account of existing behavioral and neural data. This mapping may provide a new foundation for building robust and comprehensive models of the interaction of habits with other, more goal-directed types of behaviors, and help to better guide research into the neural mechanisms underlying control of instrumental behaviors more generally.

show abstract

Dual Competition between the Basal Ganglia and the Cortex: from Action-Outcome to Stimulus-Response

Cited by 2 publications

References 86 publications

Habits without values.

Habits without values.

Habits without Values

Contact Info

Product

Resources

About