Dopamine neurons are proposed to signal the reward prediction error in model-free reinforcement learning algorithms. This term represents the unpredicted or 'excess' value of the rewarding event, value that is then added to the intrinsic value of any antecedent cues, contexts or events. To support this proposal, proponents cite evidence that artificiallyinduced dopamine transients cause lasting changes in behavior. Yet these studies do not generally assess learning under conditions where an endogenous prediction error would occur. Here, to address this, we conducted three experiments where we optogenetically activated dopamine neurons while rats were learning associative relationships, both with and without reward. In each experiment, the antecedent cues failed to acquire value and instead entered into associations with the later events, whether valueless cues or valued rewards. These results show that in learning situations appropriate for the appearance of a prediction error, dopamine transients support associative, rather than model-free, learning.
Reward-evoked dopamine transients are well-established as prediction errors. However the central tenet of temporal difference accounts-that similar transients evoked by reward-predictive cues also function as errors-remains untested. Here we addressed this by showing that optogenetically-shunting dopamine activity at the start of a reward-predicting cue prevents secondorder conditioning without affecting blocking. These results indicate that cue-evoked transients function as temporal-difference prediction errors rather than reward predictions.
Dopamine neurons fire transiently in response to unexpected rewards. These neural correlates are proposed to signal the reward prediction error described in model-free reinforcement learning algorithms. This error term represents the unpredicted or 'excess' value of the rewarding event. In model-free reinforcement learning, this value is then stored as part of the learned value of any antecedent cues, contexts or events, making them intrinsically valuable, independent of the specific rewarding event that caused the prediction error. In support of equivalence between dopamine transients and this model-free error term, proponents cite causal optogenetic studies showing that artificially induced dopamine transients cause lasting changes in behavior. Yet none of these studies directly demonstrate the presence of cached value under conditions appropriate for associative learning. To address this gap in our knowledge, we conducted three studies where we optogenetically activated dopamine neurons while rats were learning associative relationships, both with and without reward. In each experiment, the antecedent cues failed to acquired value and instead entered into value-independent associative relationships with the other cues or rewards. These results show that dopamine transients, constrained within appropriate learning situations, support valueless associative learning.
Multiple sclerosis (MS) is an autoimmune and neurodegenerative disease driven by inflammation and demyelination in the brain, spinal cord, and optic nerve. Optic neuritis, characterized by inflammation and demyelination of the optic nerve, is a symptom in many patients with MS. The optic nerve is the highway for visual information transmitted from the retina to the brain. It contains axons from the retinal ganglion cells (RGCs) that reside in the retina, myelin forming oligodendrocytes and resident microglia and astrocytes. Inflammation, demyelination, and axonal degeneration are also present in the optic nerve of mice subjected to experimental autoimmune encephalomyelitis (EAE), a preclinical mouse model of MS. Monitoring the optic nerve in EAE is a useful strategy to study the presentation and progression of pathology in the visual system; however, current approaches have relied on sectioning, staining and manual quantification. Further, information regarding the spatial load of lesions and inflammation is dependent on the area of sectioning. To better characterize cellular pathology in the EAE model, we employed a tissue clearing and 3D immunolabelling and imaging protocol to observe patterns of immune cell infiltration and activation throughout the optic nerve. Increased density of TOPRO staining for nuclei captured immune cell infiltration and Iba1 immunostaining was employed to monitor microglia and macrophages. Axonal degeneration was monitored by neurofilament immunolabelling to reveal axonal swellings throughout the optic nerve. In parallel, we developed a convolutional neural network with a UNet architecture (CNN-UNet) called BlebNet for automated identification and quantification of axonal swellings in whole mount optic nerves. Together this constitutes a toolkit for 3-dimensional immunostaining to monitor general optic nerve pathology and fast automated quantification of axonal defects that could also be adapted to monitor axonal degeneration and inflammation in other neurodegenerative disease models.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with đź’™ for researchers
Part of the Research Solutions Family.