2023
DOI: 10.3390/electronics12071571
|View full text |Cite
|
Sign up to set email alerts
|

A High-Performance FPGA-Based Depthwise Separable Convolution Accelerator

Abstract: Depthwise separable convolution (DSC) significantly reduces parameter and floating operations with an acceptable loss of accuracy and has been widely used in various lightweight convolutional neural network (CNN) models. In practical applications, however, DSC accelerators based on graphics processing units (GPUs) cannot fully exploit the performance of DSC and are unsuitable for mobile application scenarios. Moreover, low resource utilization due to idle engines is a common problem in DSC accelerator design. … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
0
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
1
1
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(2 citation statements)
references
References 28 publications
0
0
0
Order By: Relevance
“…Similarly, Jang J G et al [34] found that depthwise separable convolutions, especially when combined with Falcon convolution, can decrease computational demands while preserving accuracy. Furthermore, Huang J et al [35] developed an FPGAbased DSC accelerator, enhancing on-chip and off-chip efficiency. These studies underline the value of depthwise separable convolutions in improving computational efficiency and model performance.…”
Section: Utilizing Convolutional Neural Network For Autofocusingmentioning
confidence: 99%
“…Similarly, Jang J G et al [34] found that depthwise separable convolutions, especially when combined with Falcon convolution, can decrease computational demands while preserving accuracy. Furthermore, Huang J et al [35] developed an FPGAbased DSC accelerator, enhancing on-chip and off-chip efficiency. These studies underline the value of depthwise separable convolutions in improving computational efficiency and model performance.…”
Section: Utilizing Convolutional Neural Network For Autofocusingmentioning
confidence: 99%
“…Among these options, FPGAs have gained popularity for implementing CNNs in embedded systems primarily due to their ability to perform convolution operations in parallel with high energy efficiency. Compared to GPUs, FPGAs offer higher energy efficiency, making them an attractive choice for resource-constrained embedded systems [4][5][6][7].…”
Section: Introductionmentioning
confidence: 99%