Accelerating Deterministic and Stochastic Binarized Neural Networks on FPGAs Using OpenCL

Lammie, Corey; Wang, Xiang; Azghadi, Mostafa Rahimi

doi:10.1109/mwscas.2019.8884910

Cited by 5 publications

(3 citation statements)

References 6 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…These tools allow engineers to describe their targeted hardware in high-level programming languages such as C to synthesize them to Register Transfer Level (RTL). The tools then offload the computationalcritical RTL to run as kernels on parallel processing platforms such as FPGAs [91].…”

Section: B Fpga Dnnsmentioning

confidence: 99%

Hardware Implementation of Deep Network Accelerators Towards Healthcare and Biomedical Applications

Azghadi

Lammie

Eshraghian

et al. 2020

IEEE Trans. Biomed. Circuits Syst.

Self Cite

152

View full text Add to dashboard Cite

The advent of dedicated Deep Learning (DL) accelerators and neuromorphic processors has brought on new opportunities for applying both Deep and Spiking Neural Network (SNN) algorithms to healthcare and biomedical applications at the edge. This can facilitate the advancement of medical Internet of Things (IoT) systems and Point of Care (PoC) devices. In this paper, we provide a tutorial describing how various technologies including emerging memristive devices, Field Programmable Gate Arrays (FPGAs), and Complementary Metal Oxide Semiconductor (CMOS) can be used to develop efficient DL accelerators to solve a wide variety of diagnostic, pattern recognition, and signal processing problems in healthcare. Furthermore, we explore how spiking neuromorphic processors can complement their DL counterparts for processing biomedical signals. The tutorial is augmented with case studies of the vast literature on neural network and neuromorphic hardware as applied to the healthcare domain. We benchmark various hardware platforms by performing a sensor fusion signal processing task combining electromyography (EMG) signals with computer vision. Comparisons are made between dedicated neuromorphic processors and embedded AI accelerators in terms of inference latency and energy. Finally, we provide our analysis of the field and share a perspective on the advantages, disadvantages, challenges, and opportunities that various accelerators and neuromorphic processors introduce to healthcare and biomedical domains.

show abstract

Section: B Fpga Dnnsmentioning

confidence: 99%

Hardware Implementation of Deep Network Accelerators Towards Healthcare and Biomedical Applications

Azghadi

Lammie

Eshraghian

et al. 2020

IEEE Trans. Biomed. Circuits Syst.

Self Cite

152

View full text Add to dashboard Cite

show abstract

“…B INARIZATION has been used to augment the performance of Deep Neural Networks (DNNs), by quantizing network parameters to binary states, replacing many resourcehungry multiply-accumulate operations with simple accumulations [1]. It has been demonstrated that Binarized Neural Networks (BNNs) implemented on customized hardware can perform inference faster than conventional DNNs on stateof-the-art Graphics Processing Units (GPUs) [2], [3], while offering notable improvements in power consumption and resource utilizations [4]- [6]. However, there is still a performance gap between DNNs and conventional BNNs [7], which binarize parameters deterministically or stochastically.…”

Section: Introductionmentioning

confidence: 99%

Training Progressively Binarizing Deep Networks Using FPGAs

Lammie,

Xiang,

Azghadi

2020

Preprint

Self Cite

View full text Add to dashboard Cite

While hardware implementations of inference routines for Binarized Neural Networks (BNNs) are plentiful, current realizations of efficient BNN hardware training accelerators, suitable for Internet of Things (IoT) edge devices, leave much to be desired. Conventional BNN hardware training accelerators perform forward and backward propagations with parameters adopting binary representations, and optimization using parameters adopting floating or fixed-point real-valued representationsrequiring two distinct sets of network parameters. In this paper, we propose a hardware-friendly training method that, contrary to conventional methods, progressively binarizes a singular set of fixed-point network parameters, yielding notable reductions in power and resource utilizations. We use the Intel FPGA SDK for OpenCL development environment to train our progressively binarizing DNNs on an OpenVINO FPGA. We benchmark our training approach on both GPUs and FPGAs using CIFAR-10 and compare it to conventional BNNs.

show abstract

“…Binarized Neural Networks (BNNs) [2], which perform binary MAC computations during forward and backward propagations, have demonstrated comparable performance to conventional DNNs, while significantly reducing resource and power utilizations [3]. On account of endurance concerns, ReRAM devices are ill-suited for implementing backward propagations, required during the training routine of BNNs where a large number of programming cycles are required.…”

Section: Introductionmentioning

confidence: 99%

Variation-aware Binarized Memristive Networks

Lammie

Krestinskaya

James³

et al. 2019

2019 26th IEEE International Conference on Electronics, Circuits and Systems (ICECS)

Self Cite

View full text Add to dashboard Cite

The quantization of weights to binary states in Deep Neural Networks (DNNs) can replace resource-hungry multiply accumulate operations with simple accumulations. Such Binarized Neural Networks (BNNs) exhibit greatly reduced resource and power requirements. In addition, memristors have been shown as promising synaptic weight elements in DNNs. In this paper, we propose and simulate novel Binarized Memristive Convolutional Neural Network (BMCNN) architectures employing hybrid weight and parameter representations. We train the proposed architectures offline and then map the trained parameters to our binarized memristive devices for inference. To take into account the variations in memristive devices, and to study their effect on the performance, we introduce variations in RON and ROFF. Moreover, we introduce means to mitigate the adverse effect of memristive variations in our proposed networks. Finally, we benchmark our BMCNNs and variationaware BMCNNs using the MNIST dataset.

show abstract

Accelerating Deterministic and Stochastic Binarized Neural Networks on FPGAs Using OpenCL

Cited by 5 publications

References 6 publications

Hardware Implementation of Deep Network Accelerators Towards Healthcare and Biomedical Applications

Hardware Implementation of Deep Network Accelerators Towards Healthcare and Biomedical Applications

Training Progressively Binarizing Deep Networks Using FPGAs

Variation-aware Binarized Memristive Networks

Contact Info

Product

Resources

About