2016
DOI: 10.1073/pnas.1523669113
|View full text |Cite
|
Sign up to set email alerts
|

Credit assignment in movement-dependent reinforcement learning

Abstract: When a person fails to obtain an expected reward from an object in the environment, they face a credit assignment problem: Did the absence of reward reflect an extrinsic property of the environment or an intrinsic error in motor execution? To explore this problem, we modified a popular decision-making task used in studies of reinforcement learning, the two-armed bandit task. We compared a version in which choices were indicated by key presses, the standard response in such tasks, to a version in which the choi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

8
87
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 74 publications
(95 citation statements)
references
References 37 publications
8
87
0
Order By: Relevance
“…Overall, this bandit was chosen on 39.49% (±9.38) of the trials, which was significantly greater than the Low Execution/High Selection error bandit (29.01%; ±6.89%, p < .001) and Neutral bandit (31.50%; ±8.64%, p = .046), with no difference for the latter two (p = .877). Consistent with previous work, when expected value is equal, participants prefer choices in which unrewarded trials are attributed to errors in movement execution rather than errors in action selection (Wu et al, 2009;Green et al, 2010;McDougle et al, 2016;Parvin et al, 2018;McDougle et al, 2019).…”
Section: Experiments 2: Behavioral Responsessupporting
confidence: 84%
See 2 more Smart Citations
“…Overall, this bandit was chosen on 39.49% (±9.38) of the trials, which was significantly greater than the Low Execution/High Selection error bandit (29.01%; ±6.89%, p < .001) and Neutral bandit (31.50%; ±8.64%, p = .046), with no difference for the latter two (p = .877). Consistent with previous work, when expected value is equal, participants prefer choices in which unrewarded trials are attributed to errors in movement execution rather than errors in action selection (Wu et al, 2009;Green et al, 2010;McDougle et al, 2016;Parvin et al, 2018;McDougle et al, 2019).…”
Section: Experiments 2: Behavioral Responsessupporting
confidence: 84%
“…Behaviorally, participants showed lower switch rates following execution errors, a pattern consistent with the hypothesis that the reinforcement learning system discounts these errors (McDougle et al, 2016;. Contrary to our prediction that the MFN would be attenuated following execution errors, these errors produced the largest MFNs.…”
Section: Differential Error Processing Indexed By the Mfnsupporting
confidence: 83%
See 1 more Smart Citation
“…Another, more probable explanation is that minimally invasive surgeons are exploiting additional resources such as explicit strategies or heuristics to adapt faster and more completely than controls. Although a long-standing view has been that visuomotor adaptation reflects incremental (i.e., trial-by-trial) error-based learning that occurs in an implicit and automatic manner, recent experimental and computational modelling work indicates that it often depends on the operation of multiple interacting learning processes [32,33]. For example, if a perturbation induces a large reaching error to the right side of the target, subjects experiencing this error signal might consciously choose to aim to the left side of the target on the next trial in order to compensate.…”
Section: Discussionmentioning
confidence: 99%
“…For example, selecting an appropriate trajectory for reaching a glass of water can lead to a low risk of spilling water, and likewise, finding a running course to easily pass through in rugby and deciding the best aiming location in a tennis match can increase the possibility of victory in a competition. Despite the importance of optimal decision-making, for over a decade, sub-optimal and overly risk-seeking behaviours have been reported in various motor decision tasks [1][2][3][4][5][6][7][8][9] . Determining how to improve sub-optimal and risk-seeking decision-making behaviour is crucial to enhance well-being in daily life and performance in sports.…”
Section: Introductionmentioning
confidence: 99%