2018
DOI: 10.1145/3186332
|View full text |Cite
|
Sign up to set email alerts
|

Toolflows for Mapping Convolutional Neural Networks on FPGAs

Abstract: In the past decade, Convolutional Neural Networks (CNNs) have demonstrated state-of-the-art performance in various Artificial Intelligence tasks. To accelerate the experimentation and development of CNNs, several software frameworks have been released, primarily targeting power-hungry CPUs and GPUs. In this context, reconfigurable hardware in the form of FPGAs constitutes a potential alternative platform that can be integrated in the existing deep learning ecosystem to provide a tunable balance between perform… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
72
0
1

Year Published

2018
2018
2021
2021

Publication Types

Select...
5
1
1

Relationship

2
5

Authors

Journals

citations
Cited by 153 publications
(74 citation statements)
references
References 87 publications
1
72
0
1
Order By: Relevance
“…Moreover, [53] and [54] presented automated frameworks specifically tailored for FPGA-based binarised and spiking neural networks respectively, while [55] proposed a library for the mapping of ConvNets on diverse embedded platforms, together with a comparative study of their design spaces. Finally, [12] provides a detailed survey of ConvNet-to-FPGA toolflows.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Moreover, [53] and [54] presented automated frameworks specifically tailored for FPGA-based binarised and spiking neural networks respectively, while [55] proposed a library for the mapping of ConvNets on diverse embedded platforms, together with a comparative study of their design spaces. Finally, [12] provides a detailed survey of ConvNet-to-FPGA toolflows.…”
Section: Related Workmentioning
confidence: 99%
“…Nevertheless, several issues increase the complexity of Con-vNet system development on FPGAs [12]. With FPGAs' size and resource specifications advancing at a fast pace and with ConvNets becoming more complex, the possible mappings of a ConvNet on an FPGA lie on a large multidimensional design space that cannot be explored manually.…”
Section: Introductionmentioning
confidence: 99%
“…while the number of elements included in the unrolled sliding window for all channels of the input feature map volume is P = K H K W N IN (7). In the case of FC Layers, batching is employed to form a similar R F C ×P matrix, each row of which contains the input feature vector of size P for a different input sample, and hence: R FC = BatchSize (8).…”
Section: Architecturementioning
confidence: 99%
“…In this context, FPGAs constitute a promising platform for CNN inference due to their customisability which enables the use of optimised low-precision arithmetic units to achieve performance gains [7]. Existing FPGA-based accelerators have produced hardware designs that span from uniform 16-bit precision [8] [9] with minimal effect on accuracy, down to very high-performance binarised networks [10], but at a significant accuracy loss.…”
Section: Introductionmentioning
confidence: 99%
“…The standard 2D convolution layers, from which the CNN is constructed, occupy over 90% of the overall processing time [17] and their latency T i on the accelerator needs to be estimated to determine the best hardware configuration through DSE. For 2D convolution, there are several categories of parallelism including filter parallelism (P F ) or channel parallelism (P C) in addition to spatial and kernel parallelisms.…”
Section: Introductionmentioning
confidence: 99%