Boosting Throughput and Efficiency of Hardware Spiking Neural Accelerators Using Time Compression Supporting Multiple Spike Codes

Xu, Changqing; Zhang, Wenrui; Liu, Yu

doi:10.3389/fnins.2020.00104

Cited by 18 publications

(12 citation statements)

References 22 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Inspired by human neurons' working patterns, spiking neural networks (SNNs) are considered as the third generation artificial neural network [1]. With the development of SNNs, a large range of applications have been demonstrated including image classification [2][3], video processing [4] [5], posture and gesture recognition [6] [7], voice recognition [8] [9]. Compared with traditional artificial neural networks (ANNs) which consist of static and continuous-valued neuron models, spiking neural networks (SNNs) have a unique event-driven computation characteristic that can respond to the events in a nearly latency-free and power-saving way [10] [11], and it is naturally more suitable for processing event stream class.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Ultra-low Latency Spiking Neural Networks with Spatio-Temporal Compression and Synaptic Convolutional Block

Cai¹,

Liu²,

Yang³

2022

Preprint

Self Cite

View full text Add to dashboard Cite

Spiking neural networks (SNNs), as one of the brain-inspired models, has spatiotemporal information processing capability, low power feature, and high biological plausibility. The effective spatio-temporal feature makes it suitable for event streams classification. However, neuromorphic datasets, such as N-MNIST, CIFAR10-DVS, DVS128-gesture, need to aggregate individual events into frames with a new higher temporal resolution for event stream classification, which causes high training and inference latency. In this work, we proposed a spatio-temporal compression method to aggregate individual events into a few time steps of synaptic current to reduce the training and inference latency. To keep the accuracy of SNNs under high compression ratios, we also proposed a synaptic convolutional block to balance the dramatic changes between adjacent time steps. And multi-threshold Leaky Integrate-and-Fire (LIF) models with learnable membrane time constants are introduced to increase its information processing capability. We evaluate the proposed method for event streams classification tasks on neuromorphic N-MNIST, CIFAR10-DVS, DVS128 gesture datasets. The experiment results show that our proposed method outperforms the state-of-the-art accuracy on nearly all datasets, using fewer time steps.

show abstract

Section: Introductionmentioning

confidence: 99%

“…Otherwise, the accuracy of the SNNs will drop significantly. In [7], a temporal compression method is proposed which can reduce the length of event streams by shrinking the duration of the input event trains. However, this method is only applied to the trained SNNs, which limits its potential.…”

Section: Introductionmentioning

confidence: 99%

Ultra-low Latency Spiking Neural Networks with Spatio-Temporal Compression and Synaptic Convolutional Block

Cai¹,

Liu²,

Yang³

2022

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…computing systems [2]. For instance, IBM's TrueNorth [3] and Intel's Loihi [4] process a single spike with a few pJ of energy.…”

mentioning

confidence: 99%

“…In recent years, many researchers have focused on the problem of the latency of SNNs and tried to solve the problems we mentioned above. Some researchers try to propose novel encoding methods to improve the efficiency of information representation to reduce the latency of SNNs [2], [6]- [8]. In [8], authors proposed a phase coding method to encode input spikes by the phase of a global reference clock and achieve latency reduction over the rate coding for image recognition.…”

mentioning

confidence: 99%

“…Compared with rate coding, the proposed method can reduce the number of time steps from hundreds to tens. In [2], authors proposed a time compression method to compress the temporal domain of spike training which can achieve up to 16× speedup with little accuracy loss. However, the achievable latency/spike reduction of a particular code can vary widely with network architecture and application.…”

mentioning

confidence: 99%

See 1 more Smart Citation

Direct Training via Backpropagation for Ultra-low Latency Spiking Neural Networks with Multi-threshold

Cai¹,

Liu²,

Yang³

2021

Preprint

Self Cite

View full text Add to dashboard Cite

Spiking neural networks (SNNs) can utilize spatio-temporal information and have a nature of energy efficiency which is a good alternative to deep neural networks(DNNs). The event-driven information processing makes SNNs can reduce the expensive computation of DNNs and save a lot of energy consumption. However, high training and inference latency is a limitation of the development of deeper SNNs. SNNs usually need tens or even hundreds of time steps during the training and inference process which causes not only the increase of latency but also the waste of energy consumption. To overcome this problem, we proposed a novel training method based on backpropagation (BP) for ultra-low latency(1-2 time steps) SNN with multi-threshold. In order to increase the information capacity of each spike, we introduce the multithreshold Leaky Integrate and Fired (LIF) model. In our proposed training method, we proposed three approximated derivative for spike activity to solve the problem of the non-differentiable issue which cause difficulties for direct training SNNs based on BP. The experimental results show that our proposed method achieves an average accuracy of 99.56%, 93.08%, and 87.90% on MNIST, FashionMNIST, and CIFAR10, respectively with only 2 time steps. For the CIFAR10 dataset, our proposed method achieve 1.12% accuracy improvement over the previously reported direct trained SNNs with fewer time steps.

show abstract

Direct training of hardware-friendly weight binarized spiking neural network with surrogate gradient learning towards spatio-temporal event-based dynamic data recognition

et al. 2021

View full text Add to dashboard Cite

Boosting Throughput and Efficiency of Hardware Spiking Neural Accelerators Using Time Compression Supporting Multiple Spike Codes

Cited by 18 publications

References 22 publications

Ultra-low Latency Spiking Neural Networks with Spatio-Temporal Compression and Synaptic Convolutional Block

Ultra-low Latency Spiking Neural Networks with Spatio-Temporal Compression and Synaptic Convolutional Block

Direct Training via Backpropagation for Ultra-low Latency Spiking Neural Networks with Multi-threshold

Direct training of hardware-friendly weight binarized spiking neural network with surrogate gradient learning towards spatio-temporal event-based dynamic data recognition

Contact Info

Product

Resources

About