2017 27th International Conference on Field Programmable Logic and Applications (FPL) 2017
DOI: 10.23919/fpl.2017.8056828
|View full text |Cite
|
Sign up to set email alerts
|

Latency-driven design for FPGA-based convolutional neural networks

Abstract: Abstract-In recent years, Convolutional Neural Networks (ConvNets) have become the quintessential component of several state-of-the-art Artificial Intelligence tasks. Across the spectrum of applications, the performance needs vary significantly, from high-throughput image recognition to the very low-latency requirements of autonomous cars. In this context, FPGAs can provide a potential platform that can be optimally configured based on different performance requirements. However, with the increasing complexity… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
56
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
3
3
3

Relationship

2
7

Authors

Journals

citations
Cited by 80 publications
(56 citation statements)
references
References 16 publications
0
56
0
Order By: Relevance
“…mean) in the error tolerance range of 1 to 5%. In the same error range, Cas-cacdeCNN also outperforms the works of [20] and [30] by up to 8.29× and 24.83× respectively. Similarly for VGG-16, the full cascade architecture outperforms the mixed OpenCL-RTL design of [29] by up to 2.73× with an average of 2.36× (geo.…”
Section: Comparison With Existing Fpga Workmentioning
confidence: 87%
“…mean) in the error tolerance range of 1 to 5%. In the same error range, Cas-cacdeCNN also outperforms the works of [20] and [30] by up to 8.29× and 24.83× respectively. Similarly for VGG-16, the full cascade architecture outperforms the mixed OpenCL-RTL design of [29] by up to 2.73× with an average of 2.36× (geo.…”
Section: Comparison With Existing Fpga Workmentioning
confidence: 87%
“…We also like to acknowledge Dr. Blair P. Bremberg and Ms. Sumaiya Hussain Sadiq for their help in professional English editing of this manuscript. VOLUME 4, 2018 NeuFlow [143] Memory-Centric Accelerator [146] nn-X [148] Roofline-based FPGA Accelerator [55] Embedded FPGA Accelerator [98] DeepBurning [155] OpenCL-based FPGA Accelerator [80] Caffeine [153], [162] fpgaConvNet [165] Loop Unrolling [78], [168] Throughput-Optimized FPGA Accelerator [170] FP-DNN [171] FINN [181] Customized CONV Loop Accelerator [83] Latency-Driven Design for FPGA-based CNNs [183] DLA [188] Winograd-based CNN Accelerator [189] OpenCL-based Architecture for Accelerating CNNs [190] Multi-CLP Accelerator for CNNs [192] Automated Systolic Array Architecture for CNN [195] End-to-End Scalable FPGA Accelerator [196] DLAU [197] An Automatic RTL Compiler for High-Throughput Deep CNNs [199] Intel's DLA [200] Angel-Eye [60] Optimizing the CONV Operation to Accelerate DNNs on FPGA [204] Loop Unrolling El-Maleh's research interests are in the areas of synthesis, testing, and verification of digital systems. In addition, he has research interests in defect and soft-error tolerance design, VLSI design, design automation and efficient FPGA implementations of deep learning algorithms and data compression techniques.…”
Section: Acknowledgmentmentioning
confidence: 99%
“…In [15] a latency-driven design method is presented as an extension of the fpgaConvNet modeling framework [14]. This work models CNNs using the synchronous dataflow (SDF) model of computation.…”
Section: Related Workmentioning
confidence: 99%