2015
DOI: 10.3389/fncom.2015.00036
|View full text |Cite
|
Sign up to set email alerts
|

RM-SORN: a reward-modulated self-organizing recurrent neural network

Abstract: Neural plasticity plays an important role in learning and memory. Reward-modulation of plasticity offers an explanation for the ability of the brain to adapt its neural activity to achieve a rewarded goal. Here, we define a neural network model that learns through the interaction of Intrinsic Plasticity (IP) and reward-modulated Spike-Timing-Dependent Plasticity (STDP). IP enables the network to explore possible output sequences and STDP, modulated by reward, reinforces the creation of the rewarded output sequ… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
12
0

Year Published

2015
2015
2021
2021

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 13 publications
(12 citation statements)
references
References 23 publications
0
12
0
Order By: Relevance
“…With these three rules at play, we exposed the model to patterned sensory stimulation that imitated retinotopic inputs from the eye ( Figure 7A). We hypothesized that during collision avoidance in real tadpoles, STDP-driven changes in the tectum may be controlled and amplified by a global learning signal that arrives if collision avoidance was unsuccessful (Savin and Triesch, 2014;Aswolinskiy and Pipa, 2015), and originates either in dimming receptors in the retina (Baranauskas et al, 2012), or in mechanosensory systems of the hindbrain Felch et al, 2016;Truszkowski et al, 2017). This approach is known as the eligibility trace model, in which changes in synaptic weights do not happen immediately, but are first "remembered" by each cell as potential changes, and are only implemented in response to a timely reinforcement signal (Seung, 2003).…”
Section: Developmental Modelmentioning
confidence: 99%
“…With these three rules at play, we exposed the model to patterned sensory stimulation that imitated retinotopic inputs from the eye ( Figure 7A). We hypothesized that during collision avoidance in real tadpoles, STDP-driven changes in the tectum may be controlled and amplified by a global learning signal that arrives if collision avoidance was unsuccessful (Savin and Triesch, 2014;Aswolinskiy and Pipa, 2015), and originates either in dimming receptors in the retina (Baranauskas et al, 2012), or in mechanosensory systems of the hindbrain Felch et al, 2016;Truszkowski et al, 2017). This approach is known as the eligibility trace model, in which changes in synaptic weights do not happen immediately, but are first "remembered" by each cell as potential changes, and are only implemented in response to a timely reinforcement signal (Seung, 2003).…”
Section: Developmental Modelmentioning
confidence: 99%
“…This finding has led to numerous studies investigating the role of synaptic scaling in controlling neural network activity (Turrigiano et al, 1998 ; Turrigiano and Nelson, 2000 ; Turrigiano, 2008 ) and in stabilizing other plasticity mechanisms (van Rossum et al, 2000 ; Stellwagen and Malenka, 2006 ; Tetzlaff, 2011 ; Toyoizumi et al, 2014 ). Indeed, synaptic scaling has proven successful in stabilizing activity in recurrent neural networks (Lazar et al, 2009 ; Remme and Wadman, 2012 ; Zenke et al, 2013 ; Effenberger and Jost, 2015 ; Miner and Triesch, 2016 ). However, these studies either used synaptic scaling as the sole homeostatic mechanism (Remme and Wadman, 2012 ; Zenke et al, 2013 ) or resorted to a variant of synaptic scaling where the scaling is not dynamically determined through a control loop using a particular target activity, but rather by a fixed multiplicative normalization rule (Lazar et al, 2009 ; Effenberger and Jost, 2015 ; Miner and Triesch, 2016 ).…”
Section: Introductionmentioning
confidence: 99%
“…RL has a long tradition in the field of machine learning which has led to several powerful algorithms, such as SARSA and Q-learning (Watkins, 1989). Similarly, a large variety of neurobiological models have been proposed in recent years (Izhikevich, 2007; Potjans et al, 2009, 2011; Urbanczik and Senn, 2009; Vasilaki et al, 2009; Frémaux et al, 2010; Frémaux et al, 2013; Jitsev et al, 2012; Friedrich et al, 2014; Rasmussen and Eliasmith, 2014; Aswolinskiy and Pipa, 2015; Baladron and Hamker, 2015; Rombouts et al, 2015; Friedrich and Lengyel, 2016; Rueckert et al, 2016). However, only a small proportion of these rely on publicly available simulators and all of them employ custom built environments.…”
Section: Introductionmentioning
confidence: 99%