Proceedings of the 55th Annual Design Automation Conference 2018
DOI: 10.1145/3195970.3196041
|View full text |Cite
|
Sign up to set email alerts
|

Efficient winograd-based convolution kernel implementation on edge devices

Abstract: The implementation of Convolutional Neural Networks on edge Internet of Things (IoT) devices is a significant programming challenge, due to the limited computational resources and the real-time requirements of modern applications. This work focuses on the efficient implementation of the Winograd convolution, based on a set of application-independent and Winograd-specific software techniques for improving the utilization of the edge devices computational resources. The proposed techniques were evaluated in Inte… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 15 publications
(10 citation statements)
references
References 6 publications
0
10
0
Order By: Relevance
“…Winograd convolution. It is an efficient algorithm to reduce multipliers for CONV3×3 and recently shows advantages on GPU [34], FPGA [38], and embedded processor [58]. However, it increases 23.5% of area in our case because the overheads of long internal bitwidths and additional pre-/post-processing become significant for our 8-bit implementation.…”
Section: Related Workmentioning
confidence: 97%
“…Winograd convolution. It is an efficient algorithm to reduce multipliers for CONV3×3 and recently shows advantages on GPU [34], FPGA [38], and embedded processor [58]. However, it increases 23.5% of area in our case because the overheads of long internal bitwidths and additional pre-/post-processing become significant for our 8-bit implementation.…”
Section: Related Workmentioning
confidence: 97%
“…The main goal of the SDK4ED project is to minimize cost, development time, and complexity of lowenergy software development processes, by providing a set of innovative solutions (i.e., toolboxes) integrated into the form of an easy-to-use platform for automatic optimization and trade-off calculation among important design-time and run-time software quality attributes. More specifically, the outcome of the project so far showcases numerous research and technical achievements with respect to the optimization of the targeted quality attributes, namely Maintainability [3,5,16,37,50], Dependability [32,33,[51][52][53][54], Energy Consumption [23,24,42,67], as well as Quality Forecasting [60,61,63] and Decision Support [47,[55][56][57] throughout the overall software development cycle. The SDK4ED TD Forecasting tool, integrated into the TD Management (TDM) framework, aims to provide predictive forecasts regarding the evolution of the TD quality attribute.…”
Section: Rq : Can Data Clustering Improve the Accuracy Of Crossproject Td Forecasting?mentioning
confidence: 99%
“…[40], [86] utilized the SIMT architecture of GPGPU to process CNN using Winograd convolution in parallel, and [40] also added support for Zero-Skip. [87], [88] used high-efficiency Winograd convolution on IoT devices to achieve high performance. [41], [89], [90] used random calculation and approximate calculation to complete the implementation.…”
Section: Cpumentioning
confidence: 99%