A Post-training Quantization Method for the Design of Fixed-Point-Based FPGA/ASIC Hardware Accelerators for LSTM/GRU Algorithms

Rapuano, Emilio; Pacini, Tommaso; Fanucci, Luca

doi:10.1155/2022/9485933

Cited by 6 publications

(4 citation statements)

References 33 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We will also focus on enabling the acceleration of feedback loops and skip connections within MDE to achieve support for residual and recurrent layers. A post-training quantization algorithm for RNNs has already been defined in one of our previous works [78].…”

Section: Comparison With Related Workmentioning

confidence: 99%

FPG-AI: A Technology-Independent Framework for the Automation of CNN Deployment on FPGAs

2023

Self Cite

View full text Add to dashboard Cite

In recent years, Convolutional Neural Networks (CNNs) have demonstrated outstanding results in several emerging classification tasks. The high-quality predictions are often achieved with computationally intensive workloads that hinder the hardware acceleration of these models at the edge. Field Programmable Gate Arrays (FPGAs) have proven to be energy efficient platforms for the execution of these algorithms and works proposing methods for automating the design on these devices have acquired relevance. The common purpose is to enable a wide range of users without specific skills to accelerate CNNs on FPGAs with reduced development times. In this paper, we present FPG-AI, a technology-independent toolflow for automating the deployment of CNNs on FPGA. The framework combines the use of model compression strategies with a fully handcrafted Hardware Description Languages (HDL)-based accelerator that poses no limit on device portability. On top of that, an automation process merges the two design spaces to define an end-to-end and ready-to-use tool. Experimental results are reported for reference models extracted from the literature (LeNet, NiN, VGG16, MobileNet-V1) on multiple classification datasets (MNIST, CIFAR10, ImageNet). To prove the technology independence of FPG-AI, we characterize the toolflow on devices with heterogeneous resource budgets belonging to different vendors (Xilinx, Intel, and Microsemi). Comparison with state-of-the-art work confirms the unmatched device portability of FPG-AI and shows performance metrics in line with the literature.

show abstract

Section: Comparison With Related Workmentioning

confidence: 99%

FPG-AI: A Technology-Independent Framework for the Automation of CNN Deployment on FPGAs

2023

Self Cite

View full text Add to dashboard Cite

show abstract

“…Weight pruning is used to reduce weight parameters, which can effectively compress network models and improve network performance. In order to solve the problem of large storage demand and network performance degradation caused by a large number of parameters when RNN is applied to natural language processing, the authors of [154] focused on the computing resource demand of RNN and adopted fixed-point quantization technology in order to design an FPGA accelerator, which reduced the memory consumption by 90%, and the accuracy loss was less than 1%.…”

Section: Fpga Accelerator For Natural Language Processingmentioning

confidence: 99%

“…Different from [154] on quantifying input data, some scholars have devoted themselves to NLP task optimization based on the BERT (bidirectional encoder representation from transformers) network model [155] and have adopted the idea of full quantization to a design accelerator. Not only input data but also weights, activations, Softmax, layer normalization, and all the intermediate results are quantified in order to compress the network and improve performance [156].…”

Section: Fpga Accelerator For Natural Language Processingmentioning

confidence: 99%

A Review of the Optimal Design of Neural Networks Based on FPGA

Wang

Luo

2022

Applied Sciences

View full text Add to dashboard Cite

Deep learning based on neural networks has been widely used in image recognition, speech recognition, natural language processing, automatic driving, and other fields and has made breakthrough progress. FPGA stands out in the field of accelerated deep learning with its advantages such as flexible architecture and logic units, high energy efficiency ratio, strong compatibility, and low delay. In order to track the latest research results of neural network optimization technology based on FPGA in time and to keep abreast of current research hotspots and application fields, the related technologies and research contents are reviewed. This paper introduces the development history and application fields of some representative neural networks and points out the importance of studying deep learning technology, as well as the reasons and advantages of using FPGA to accelerate deep learning. Several common neural network models are introduced. Moreover, this paper reviews the current mainstream FPGA-based neural network acceleration technology, method, accelerator, and acceleration framework design and the latest research status, pointing out the current FPGA-based neural network application facing difficulties and the corresponding solutions, as well as prospecting the future research directions. We hope that this work can provide insightful research ideas for the researchers engaged in the field of neural network acceleration based on FPGA.

show abstract

“…This article has been retracted by Hindawi, as publisher, following an investigation undertaken by the publisher [ 1 ]. This investigation has uncovered evidence of systematic manipulation of the publication and peer-review process.…”

mentioning

confidence: 99%

Retracted: A Post‐training Quantization Method for the Design of Fixed‐Point‐Based FPGA/ASIC Hardware Accelerators for LSTM/GRU Algorithms

Intelligence and Neuroscience

2023

Computational Intelligence and Neuroscience

View full text Add to dashboard Cite

A Post-training Quantization Method for the Design of Fixed-Point-Based FPGA/ASIC Hardware Accelerators for LSTM/GRU Algorithms

Cited by 6 publications

References 33 publications

FPG-AI: A Technology-Independent Framework for the Automation of CNN Deployment on FPGAs

FPG-AI: A Technology-Independent Framework for the Automation of CNN Deployment on FPGAs

A Review of the Optimal Design of Neural Networks Based on FPGA

Retracted: A Post‐training Quantization Method for the Design of Fixed‐Point‐Based FPGA/ASIC Hardware Accelerators for LSTM/GRU Algorithms

Contact Info

Product

Resources

About