Lose-Shift Responding in Humans Is Promoted by Increased Cognitive Load

Ivan, Victorita E.; Banks, Parker; Goodfellow, Kris; Gruber, Aaron J.

doi:10.3389/fnint.2018.00009

Cited by 15 publications

(19 citation statements)

References 59 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Potentially, the disadvantageous choices could be caused solely by negative outcomes of a preceding advantageous choice, which could prompt the participants to immediately change their preference (Gaffan & Davies, 1981;Ivan, Banks, Goodfellow, & Gruber, 2018) i.e., to follow a Win-Stay Lose-Shift strategy (see e.g., Ellerby & Tunney, 2017). In order to check whether the disadvantageous choices were mainly caused by a previous loss, we investigated probabilities of transitions leading from the advantageous to the disadvantageous choice.…”

Section: Trial Selection Criteria and The Factors Of Interestmentioning

confidence: 99%

Pupil Dilation and Response Slowing Distinguish Deliberate Explorative Choices in the Probabilistic Learning Task

Kozunova

Sayfulina

Prokofyev

et al. 2021

Preprint

View full text Add to dashboard Cite

This study examined whether pupil size and response time would distinguish directed exploration from random exploration and exploitation. Eighty-nine participants performed the two-choice probabilistic learning task while their pupil size and response time were continuously recorded. Using LMM analysis, we estimated differences in the pupil size and response time between the advantageous and disadvantageous choices as a function of learning success, i.e., whether or not a participant has learned the probabilistic contingency between choices and their outcomes. We proposed that before a true value of each choice became known to a decision-maker, both advantageous and disadvantageous choices represented a random exploration of the two options with an equally uncertain outcome, whereas the same choices after learning manifested exploitation and direct exploration strategies, respectively. We found that disadvantageous choices were associated with increases both in response time and pupil size, but only after the participants had learned the choice-reward contingencies. For the pupil size, this effect was strongly amplified for those disadvantageous choices that immediately followed gains as compared to losses in the preceding choice. Pupil size modulations were evident during the behavioral choice rather than during the pretrial baseline. These findings suggest that occasional disadvantageous choices, which violate the acquired internal utility model, represent directed exploration. This exploratory strategy shifts choice priorities in favor of information seeking and its autonomic and behavioral concomitants are mainly driven by the conflict between the behavioral plan of the intended exploratory choice and its strong alternative, which has already proven to be more rewarding.

show abstract

Section: Trial Selection Criteria and The Factors Of Interestmentioning

confidence: 99%

Pupil Dilation and Response Slowing Distinguish Deliberate Explorative Choices in the Probabilistic Learning Task

Kozunova

Sayfulina

Prokofyev

et al. 2021

Preprint

View full text Add to dashboard Cite

show abstract

“…As such, the internal phasic state generated by positive outcomes along with the external experience of switching opponents makes it more likely that better performance will be expressed. In addition to extending time between trials [58], switching opponents can be added to the toolkit that helps to reduce the expression of typical reinforcement learning rules, thereby helping individuals who may be at their most cognitively vulnerable within competitive environments.…”

Section: Discussionmentioning

confidence: 99%

Switching Competitors Reduces Win-Stay but Not Lose-Shift Behaviour: The Role of Outcome-Action Association Strength on Reinforcement Learning

Srihaput

Craplewe

Dyson

2020

Games

View full text Add to dashboard Cite

Predictability is a hallmark of poor-quality decision-making during competition. One source of predictability is the strong association between current outcome and future action, as dictated by the reinforcement learning principles of win–stay and lose–shift. We tested the idea that predictability could be reduced during competition by weakening the associations between outcome and action. To do this, participants completed a competitive zero-sum game in which the opponent from the current trial was either replayed (opponent repeat) thereby strengthening the association, or, replaced (opponent change) by a different competitor thereby weakening the association. We observed that win–stay behavior was reduced during opponent change trials but lose–shiftbehavior remained reliably predictable. Consistent with the group data, the number of individuals who exhibited predictable behavior following wins decreased for opponent change relative to opponent repeat trials. Our data show that future actions are more under internal control following positive relative to negative outcomes, and that externally breaking the bonds between outcome and action via opponent association also allows us to become less prone to exploitation.

show abstract

“…While the data from Experiment 2 suggest a somewhat pessimistic (inhibitory) role of interruption during competition, it is clear that similar kinds of decision-making might be naturally interrupted after the experience of a positive outcome either by the player in the form of post-reinforcement pausing (e.g., Dixon & Schreiber, 2004;Dyson et al, 2018;Verbruggen et al, 2017;Forder & Dyson, 2016) or by the opponent itself such as in the case of slot machines where longer music tends to play when the win is bigger (e.g., Dixon et al, 2013). This leads us to the intriguing possibility that mandatory pauses following negative outcomes during play conditions might help to break the cyclical poorer-quality decision making characterised in problem gambling (see Ivan, Banks, Goodfellow & Gruber, 2018, for a similar suggestion).…”

Section: Discussionmentioning

confidence: 99%

Behavioural and neural limits in competitive decision making: The roles of outcome, opponency and observation

Dyson

Steward

Meneghetti

et al. 2019

Preprint

View full text Add to dashboard Cite

To understand the boundaries we set for ourselves in terms of environmental responsibility during competition, we examined a neural index of outcome valence (feedback-related negativity; FRN) in relation to earlier indices of visual attention (N1), later indices of motivational significance (P3), and, eventual behaviour. In Experiment 1 (n=36), participants either were (play) or were not (observe) responsible for action selection. In Experiment 2 (n=36), opponents additionally either could (exploitable) or could not (unexploitable) be beaten. Various failures in reinforcement learning expression were revealed including large-scale approximations of random behaviour. Against unexploitable opponents, N1 determined the extent to which negative and positive outcomes were perceived as distinct categories by FRN. Against exploitable opponents, FRN determined the extent to which P3 generated neural gain for future events. Differential activation of the N1 -FRN -P3 processing chain provides a framework for understanding the behavioural dynamism observed during competitive decision making.

show abstract

Lose-Shift Responding in Humans Is Promoted by Increased Cognitive Load

Cited by 15 publications

References 59 publications

Pupil Dilation and Response Slowing Distinguish Deliberate Explorative Choices in the Probabilistic Learning Task

Pupil Dilation and Response Slowing Distinguish Deliberate Explorative Choices in the Probabilistic Learning Task

Switching Competitors Reduces Win-Stay but Not Lose-Shift Behaviour: The Role of Outcome-Action Association Strength on Reinforcement Learning

Behavioural and neural limits in competitive decision making: The roles of outcome, opponency and observation

Contact Info

Product

Resources

About