2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2018
DOI: 10.1109/cvprw.2018.00219
|View full text |Cite
|
Sign up to set email alerts
|

Ultra Power-Efficient CNN Domain Specific Accelerator with 9.3TOPS/Watt for Mobile and Embedded Applications

Abstract: Computer vision performances have been significantly improved in recent years by Convolutional Neural Networks (CNN). Currently, applications using CNN algorithms are deployed mainly on general purpose hardwares, such as CPUs, GPUs or FPGAs. However, power consumption, speed, accuracy, memory footprint, and die size should all be taken into consideration for mobile and embedded applications. Domain Specific Architecture (DSA) for CNN is the efficient and practical solution for CNN deployment and implementation… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
18
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
5
2
1

Relationship

2
6

Authors

Journals

citations
Cited by 20 publications
(18 citation statements)
references
References 23 publications
0
18
0
Order By: Relevance
“…This section will present how a reconfigurable approximate multiplier with two precisions can be implemented to allow for the utilization of the concept in single-core systems. This will expand the proposed concept to include the design of approximate multiplier-based CNN accelerators for embedded systems and low power applications such as [42][43]. Fig 13. demonstrates how a reconfigurable approximate multiplier with two precisions can be implemented inside each MAC unit.…”
Section: B Single-core Reconfigurable Architecturementioning
confidence: 99%
“…This section will present how a reconfigurable approximate multiplier with two precisions can be implemented to allow for the utilization of the concept in single-core systems. This will expand the proposed concept to include the design of approximate multiplier-based CNN accelerators for embedded systems and low power applications such as [42][43]. Fig 13. demonstrates how a reconfigurable approximate multiplier with two precisions can be implemented inside each MAC unit.…”
Section: B Single-core Reconfigurable Architecturementioning
confidence: 99%
“…Low power consumption and high inference speed are practical requirements when deploying semantic segmentation with constraints on power and computation resources. The recent years development and wide availability of lowpower CNN (Convolutional Neural Networks) accelerators [11,7] has made the semantic segmentation applications on mobile devices more feasible. There are some significant advantages of these CNN accelerator chips.…”
Section: Introductionmentioning
confidence: 99%
“…First, ultra-low power. The CNN accelerator chip in [11] has the low power consumption of only 300mW. The most recent released chip has the peak power of only 224mW.…”
Section: Introductionmentioning
confidence: 99%
“…It has many real-world applications to be deployed on edge devices. CNN (Convolutional Neural Networks) accelerators [19,15] are ideal for these applications by providing high inference speed and low power consumption. The most recent chip has the peak power of only 224mW [14].…”
Section: Introductionmentioning
confidence: 99%