2020
DOI: 10.3389/fnbeh.2019.00270
|View full text |Cite
|
Sign up to set email alerts
|

Separating Probability and Reversal Learning in a Novel Probabilistic Reversal Learning Task for Mice

Abstract: The exploration/exploitation tradeoff-pursuing a known reward vs. sampling from lesser known options in the hope of finding a better payoff-is a fundamental aspect of learning and decision making. In humans, this has been studied using multi-armed bandit tasks. The same processes have also been studied using simplified probabilistic reversal learning (PRL) tasks with binary choices. Our investigations suggest that protocols previously used to explore PRL in mice may prove beyond their cognitive capacities, wit… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
29
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 32 publications
(32 citation statements)
references
References 24 publications
0
29
0
Order By: Relevance
“…RL is a popular framework to model probabilistic reversal learning (Boehme et al, 2017;Chase et al, 2010;Gläscher et al, 2009;Hauser et al, 2015;Javadi et al, 2014;Metha et al, 2020;Peterson et al, 2009). RL agents choose actions based on action values that reflect actions' expected long-term cumulative reward.…”
Section: Reinforcement Learning (Rl)mentioning
confidence: 99%
See 1 more Smart Citation
“…RL is a popular framework to model probabilistic reversal learning (Boehme et al, 2017;Chase et al, 2010;Gläscher et al, 2009;Hauser et al, 2015;Javadi et al, 2014;Metha et al, 2020;Peterson et al, 2009). RL agents choose actions based on action values that reflect actions' expected long-term cumulative reward.…”
Section: Reinforcement Learning (Rl)mentioning
confidence: 99%
“…Such model-independent analyses have led to many interesting insights, but have been unable to test hypotheses about specific cognitive processes that are at work while subjects perform the task. In an effort to better understand these processes, more recent studies have started to employ computational modeling, most often in the RL framework (e.g., Boehme et al, 2017; Chase et al, 2010; Gläscher et al, 2009; Hauser et al, 2015; Javadi et al, 2014; Metha et al, 2020; Peterson et al, 2009).…”
Section: Introductionmentioning
confidence: 99%
“…Because Cadm2 has been previously associated with alcohol consumption, we used a within-subjects design to assess risky behavior under acute doses of ethanol (0, 0.5, 1g/kg), as previously described (41). In addition, we evaluated motivation, as measured by a Progressive Ratio Breakpoint task [ PBRT , (42, 43)], and behavioral flexibility, as measured by a Probabilistic Reversal Learning task [ PRL , (44)]. General exploration was measured via the Behavioral pattern monitor [ BPM , (45, 46)].…”
Section: Methodsmentioning
confidence: 99%
“…Other studies have thus referred to task design as a "bandit" task--a reinforcement learning paradigm. Within this framework, the focus is on changes in two variables: learning rate, which describes how quickly the subject incorporate new evidence into its decisions, and inverse temperature, which describes the subject's confidence in the decision (Groman et al, 2016;Metha et al, 2019;Sutton & Barto, 2018). These two approaches, WSLS and reinforcement learning, are rarely combined to understand behavioral mechanisms or neuronal measures of reversal learning (Worthy & Maddox, 2014).…”
Section: Noradrenergic Regulation Of Two-armed Bandit Performancementioning
confidence: 99%
“…Probabilistic outcomes that force animals to integrate multiple previous trials to guide choice increase the difficulty of the task. Other studies have thus referred to probabilistic reversal learning as a “bandit” task--a reinforcement learning paradigm (Groman et al, 2016; Metha et al, 2019; Sutton & Barto, 2018). These two approaches are rarely combined to understand behavioral mechanisms or neuronal measures of reversal learning (Worthy & Maddox, 2014).…”
mentioning
confidence: 99%