Adaptive time scales in recurrent neural networks

Quax, Silvan C.; D’Asaro, Michele; Gerven, Marcel A. J. van

doi:10.1101/800540

Cited by 6 publications

(4 citation statements)

References 31 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Our computational approach was inspired by the neurocognitive models of Botvinick, (2007) and Kiebel et al (2008), in which higher stages of cortical processing learned or controlled temporal structure at longer timescales. More generally, multiscale machine-learning architectures have been proposed for reducing the complexity of the learning problem at each scale and for representing multi-scale environments (Chung et al, 2016;Jaderberg et al, 2019;Mozer, 1992;Mujika et al, 2017;Quax et al, 2019;Schmidhuber, 1992). In neuroscience, multiple timescale representations have been proposed for learning value functions (Sutton, 1995), tracking reward (Bernacchia et al, 2011), and perceiving and controlling action (Botvinick, 2007;Paine and Tani, 2005).…”

Section: Discussionmentioning

confidence: 99%

Constructing and Forgetting Temporal Context in the Human Cerebral Cortex

Chien

Honey

2020

Neuron

View full text Add to dashboard Cite

Constructing and Forgetting Temporal Context in the Human Cerebral Cortex Highlights d Distinct cortical responses when the same stimulus is preceded by different contexts d Responses align as common input continues: sensory cortex, then higher-order cortex d Cortical regions maintain a distributed and hierarchical representation of context d Distributed cortical memory is gated and prior context can be flexibly forgotten

show abstract

Section: Discussionmentioning

confidence: 99%

Constructing and Forgetting Temporal Context in the Human Cerebral Cortex

Chien

Honey

2020

Neuron

View full text Add to dashboard Cite

show abstract

“…A number of studies have used heterogeneous or tunable time constants (Fang et al, 2020;Quax et al, 2020;Yin et al, 2020), but these have generally been focussed on maximising performance for neuromorphic applications, and not considering the potential role in real nervous systems. In particular, we have shown that: heterogeneity is particularly important for the type of temporally complex tasks faced in real environments, as compared to the static ones often considered in machine learning; heterogeneity confers robustness allowing for learning in a wide range of environments; optimal distributions of time constants are consistent across training runs and match experimental data; and that our results are not specific to a particular task or training method.…”

Section: Discussionmentioning

confidence: 99%

Neural heterogeneity promotes robust learning

Perez-Nieves

Leung

Dragotti

et al. 2020

Preprint

View full text Add to dashboard Cite

The brain has a hugely diverse, heterogeneous structure. By contrast, many functional neural models are homogeneous. We compared the performance of spiking neural networks trained to carry out difficult tasks, with varying degrees of heterogeneity. Introducing heterogeneity in membrane and synapse time constants substantially improved task performance, and made learning more stable and robust across multiple training methods, particularly for tasks with a rich temporal structure. In addition, the distribution of time constants in the trained networks closely matches those observed experimentally. We suggest that the heterogeneity observed in the brain may be more than just the byproduct of noisy processes, but rather may serve an active and important role in allowing animals to learn in changing environments.SummaryNeural heterogeneity is metabolically efficient for learning, and optimal parameter distribution matches experimental data.

show abstract

“…As it will be shown below, the usage of several stacked convolutional layers in CNNs, used to create a hierarchy of progressively more abstract representations of the input data, constitutes their main strength for mining spatial dependencies. Similarly, recurrent neural networks (RNNs) are the most suitable neural network typology for modeling temporal dependencies in time‐series data 21 …”

Section: The Anomaly Detection Frameworkmentioning

confidence: 99%

A stacked autoencoder‐based convolutional and recurrent deep neural network for detecting cyberattacks in interconnected power control systems

D’Angelo

Palmieri

2021

Int J of Intelligent Sys

View full text Add to dashboard Cite

Modern interconnected power grids are a critical target of many kinds of cyber-attacks, potentially affecting public safety and introducing significant economic damages. In such a scenario, more effective detection and early alerting tools are needed. This study introduces a novel anomaly detection architecture, empowered by modern machine learning techniques and specifically targeted for power control systems. It is based on stacked deep neural networks, which have proven to be capable to timely identify and classify attacks, by autonomously eliciting knowledge about them. The proposed architecture leverages automatically extracted spatial and temporal dependency relations to mine meaningful insights from data coming from the target power systems, that can be used as new features for classifying attacks. It has proven to achieve very high performance when applied to real scenarios by outperforming state-of-the-art available approaches.

show abstract

Adaptive time scales in recurrent neural networks

Cited by 6 publications

References 31 publications

Constructing and Forgetting Temporal Context in the Human Cerebral Cortex

Constructing and Forgetting Temporal Context in the Human Cerebral Cortex

Neural heterogeneity promotes robust learning

A stacked autoencoder‐based convolutional and recurrent deep neural network for detecting cyberattacks in interconnected power control systems

Contact Info

Product

Resources

About