Memory decay and generalization following distinct motor learning mechanisms

Bao, Shancheng; Lei, Yuming

doi:10.1152/jn.00105.2022

Cited by 9 publications

(10 citation statements)

References 96 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Reinforcement learning also impacts motor memories. Implicit aftereffects, defined as motor memories not under conscious control ( Krakauer et al, 2019 ), are strengthened when reinforcement is combined with sensorimotor adaptation ( Huang et al, 2011 ; Shmuelof et al, 2012 ; Galea et al, 2015 ) or use-dependent learning ( Mawase et al, 2017 ; Bao and Lei, 2022 ; c.f., Tsay et al, 2022 ). Reinforcement learning may also strengthen explicit retention, defined as the ability to consciously remember and reproduce a previously learned movement ( Schmidt and Lee, 2005 ), potentially because individuals benefit from determining what the successful movement is themselves through exploration, improving engagement in the task, unlike when receiving full visual feedback of movements ( Winstein and Schmidt, 1990 ; Winstein et al, 1994 ; Hasson et al, 2015 ).…”

Section: Introductionmentioning

confidence: 99%

Reinforcement Learning during Locomotion

Wood,

Kim,

Morton

2024

eNeuro

View full text Add to dashboard Cite

When learning a new motor skill, people often must use trial and error to discover which movement is best. In the reinforcement learning framework, this concept is known as exploration and has been linked to increased movement variability in motor tasks. For locomotor tasks, however, increased variability decreases upright stability. As such, exploration during gait may jeopardize balance and safety, making reinforcement learning less effective. Therefore, we set out to determine if humans could acquire and retain a novel locomotor pattern using reinforcement learning alone. Young healthy male and female participants walked on a treadmill and were provided with binary reward feedback (indicated by a green checkmark on the screen) that was tied to a fixed monetary bonus, to learn a novel stepping pattern. We also recruited a comparison group who walked with the same novel stepping pattern but did so by correcting for target error, induced by providing real time veridical visual feedback of steps and a target. In two experiments, we compared learning, motor variability, and two forms of motor memories between the groups. We found that individuals in the binary reward group did, in fact, acquire the new walking pattern by exploring (increasing motor variability). Additionally, while reinforcement learning did not increase implicit motor memories, it resulted in more accurate explicit motor memories compared to the target error group. Overall, these results demonstrate that humans can acquire new walking patterns with reinforcement learning and retain much of the learning over 24 hours.Significance StatementHumans can learn some novel movements by independently discovering the actions that lead to success. This discovery process, exploration, requires increased motor variability to determine the best movement. However, in bipedal locomotion especially, increasing motor variability decreases stability, heightening the risk of negative outcomes such as a trip, injury, or fall. Despite this stability constraint, the current study shows that individuals do use exploration to find the most rewarding walking patterns. This form of learning led to improved explicit retention but not implicit aftereffects. Thus, the reinforcement learning framework can explain findings across a wide range of motor and cognitive tasks, including locomotion.

show abstract

Section: Introductionmentioning

confidence: 99%

Reinforcement Learning during Locomotion

Wood,

Kim,

Morton

2024

eNeuro

View full text Add to dashboard Cite

show abstract

Section: Introductionmentioning

confidence: 99%

“…Reinforcement learning also impacts motor memories. Implicit aftereffects, defined as motor memories not under conscious control (Krakauer et al, 2019), are strengthened when reinforcement is combined with sensorimotor adaptation (Galea et al, 2015; Huang et al, 2011; Shmuelof et al, 2012), or use-dependent learning (Mawase et al, 2017; Bao and Lei, 2022; c.f., Tsay et al, 2022). Reinforcement learning may also strengthen explicit retention, defined as the ability to consciously remember and reproduce a previously learned movement (Schmidt and Lee, 2005), potentially because individuals benefit from determining what the successful movement is themselves through exploration, improving engagement in the task, unlike when receiving full visual feedback of movements (Hasson et al, 2015; Winstein et al, 1994; Winstein and Schmidt, 1990).…”

Section: Introductionmentioning

confidence: 99%

Reinforcement learning during locomotion

Wood,

Kim,

Morton

2023

Preprint

View full text Add to dashboard Cite

When learning a new motor skill, people often must use trial and error to discover which movement is best. In the reinforcement learning framework, this concept is known as exploration and has been observed as increased movement variability in motor tasks. For locomotor tasks, however, increased variability decreases upright stability. As such, exploration during gait may jeopardize balance and safety, making reinforcement learning less effective. Therefore, we set out to determine if humans could acquire and retain a novel locomotor pattern using reinforcement learning alone. Young healthy male and female humans walked on a treadmill and were provided with binary reward feedback (success or failure only) to learn a novel stepping pattern. We also recruited a comparison group who walked with the same novel stepping pattern but did so by correcting for target error, induced by providing real time veridical visual feedback of steps and a target. In two experiments, we compared learning, motor variability, and two forms of motor memories between the groups. We found that individuals in the binary reward group did, in fact, acquire the new walking pattern by exploring (increased variability). Additionally, while reinforcement learning did not increase implicit motor memories, it resulted in more accurate explicit motor memories compared to the target error group. Overall, these results demonstrate that humans can acquire new walking patterns with reinforcement learning and retain much of the learning over 24 hours.Significance StatementHumans can learn some novel movements by independently discovering the actions that lead to success. This discovery process, exploration, requires increased motor variability to determine the best movement. However, in bipedal locomotion especially, increasing motor variability decreases stability, heightening the risk of negative outcomes such as a trip, injury, or fall. Despite this stability constraint, the current study shows that individuals do use exploration to find the most rewarding walking patterns. This form of learning led to improved explicit retention but not implicit aftereffects. Thus, the reinforcement learning framework can explain findings across a wide range of motor and cognitive tasks, including locomotion.

show abstract

“…To save time and facilitate measuring performance, experimental tasks often involve movements of fewer body parts in a setting with an instructed movement goal and in which movement success can easily be defined and measured. Most experiments involve target-directed movements with the arm (Bao & Lei, 2022;Bernardi et al, 2015;Cashaback et al, 2019;Holland et al, 2019;Holland et al, 2018;Ikegami et al, 2022;Izawa & Shadmehr, 2011;Kuling et al, 2019;Manley et al, 2014;Pekny et al, 2015;Roth et al, 2023;Shmuelof et al, 2012;Sidarta et al, 2018Sidarta et al, , 2022Therrien et al, 2016Therrien et al, , 2018van der Kooij & Overvliet, 2016; or finger (Uehara et al, 2019).…”

Section: Thinking Inside the Box: Scientific Framework On Reward-base...mentioning

confidence: 99%

“…Alternatively, feedback about limb movement is not perturbed but feedback about movement success is: external feedback can be manipulated in such a way that not all movements have an equal probability of reward, so that participants learn to move in ways that yield the most reward. Reaches that are within a visual target may for example be more frequently rewarded the more they are on the left side of the target (Cashaback et al, 2019) or only when they are in a specific zone within the visual target (Bao & Lei, 2022;Bernardi et al, 2015;Manley et al, 2014;Roth et al, 2023;Sidarta et al, 2018Sidarta et al, , 2022. In other studies, participants had to learn to correct for their own natural biases in reaching to unseen targets (Kuling et al, 2019) or the relation between their joint motion and reward (Vassiliadis et al, 2021(Vassiliadis et al, , 2022Wiegel, 2021).…”

Section: Thinking Inside the Box: Scientific Framework On Reward-base...mentioning

confidence: 99%

Exploration in reward-based motor learning

van Mastrigt

View full text Add to dashboard Cite

Our movements are variable: no movement is the same, even if you intend to make the same movement. This motor variability may allow you to find the movements that serve your movement goals best. The relation between variability and motor learning is however unclear. This may be the case because motor variability can arise from two sources: inevitable sensorimotor noise (inherent, random variability) and exploration (variability that can be controlled and can be learnt from). In this thesis, I aimed to disentangle sensorimotor noise and exploration to study the role of exploration in reward-based motor learning. We speak of this type of motor learning when learning how to move based on binary success and failure feedback: whether your movement was successful or not. In this thesis, we operationalized exploration as the additional variability following failure as compared to following success. In the first part of my thesis, I report on one experiment and one simulation study aiming to validate a method for quantifying exploration based on this operationalization. In Chapter 2, we proposed to estimate variability based on trial-to-trial changes, since standard measures of variability are sensitive to learning. We consider trial-to-trial changes following successful trials sensorimotor noise, since participants likely aim to repeat their movement in that situation. We proposed a trial-to-trial change (TTC) method for quantifying exploration as the additional trial-to-trial change following failed trials relative to the trial-to-trial change following successful trials. In Chapter 3, we aimed to validate the trial-to-trial change (TTC) method by simulating learning with four reward-based motor learning models and comparing the input exploration of the models to the trial-to-trial change exploration estimates of our method. Since the simulations allowed us to identify two pitfalls in quantifying exploration in reward-based motor learning, we reformulated our method to the additional trial-to-trial change (ATTC) method that is valid for one class of reward-based motor learning models and under specific circumstances. In the last part of my thesis, I report on two experiments in which I studied reward-based motor learning. In Chapter 4, I found that binary success and failure feedback can induce implicit learning, and that this implicit learning cannot be explained by use-dependent learning. In Chapter 5, I found no relation between overall motor variability, sensorimotor noise and exploration as estimated with the ATTC method on the one hand, and reward-based motor learning on the other hand. In conclusion, we operationalized the intuitive concept of exploration as the additional variability following failed movements as compared to successful movements. Even this intuitive operationalization seems to have its snags. Our ATTC method could be used for quantifying exploration under specific circumstances and under the assumption that humans control exploratory variability only based on the success of the most recent movement. A more general lesson is that intuitive concepts like exploration and implicit learning are difficult to capture once one tries to formalize them.

show abstract

Memory decay and generalization following distinct motor learning mechanisms

Cited by 9 publications

References 96 publications

Reinforcement Learning during Locomotion

Reinforcement Learning during Locomotion

Reinforcement learning during locomotion

Exploration in reward-based motor learning

Contact Info

Product

Resources

About