2021
DOI: 10.1016/j.sysarc.2020.101839
|View full text |Cite
|
Sign up to set email alerts
|

A survey On hardware accelerators and optimization techniques for RNNs

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
19
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
7
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 32 publications
(19 citation statements)
references
References 28 publications
0
19
0
Order By: Relevance
“…After applying the CNN acceleration, the RNN became the new bottleneck to be targeted for acceleration. In existing works, RNNs are also candidates for acceleration on FPGA [29], and their accelerators present better energy efficiency than CPU/GPU [30]. Unlike these works, our challenge is to achieve an efficient implementation alongside the DPU and extend the HW/SW interface to integrate the RNN in the Vitis AI environment.…”
Section: Discussionmentioning
confidence: 99%
“…After applying the CNN acceleration, the RNN became the new bottleneck to be targeted for acceleration. In existing works, RNNs are also candidates for acceleration on FPGA [29], and their accelerators present better energy efficiency than CPU/GPU [30]. Unlike these works, our challenge is to achieve an efficient implementation alongside the DPU and extend the HW/SW interface to integrate the RNN in the Vitis AI environment.…”
Section: Discussionmentioning
confidence: 99%
“…This includes the training of different types of layers, such as CNN (Convolutional Neural Network) or RNN (Recurrent Neural Network) layers. An overview of CNN inference accelerators on FPGA can be found, for example, in [1], while a survey of accelerators for recurrent neural networks, including LSTMs, can be found in [50]. In addition to publications that focus on the acceleration of DNN inference, some publications tackle the problem of implementing backpropagation for neural network training on FPGAs as well.…”
Section: Neural Network In Fpga-based Drl Implementationsmentioning
confidence: 99%
“…In hardware, over 70 implementations on ASIC, FPGA, or GPU/CPU have been demonstrated in Ref. [60], but only two were applied to action recognition using video footage [61], [62]. To our knowledge, no RNN for animal behavior monitoring on hardware has been described.…”
Section: Proposed Cow Behavior Distribution Estimating Neural Network...mentioning
confidence: 99%