2022 IEEE Symposium in Low-Power and High-Speed Chips (COOL CHIPS) 2022
DOI: 10.1109/coolchips54332.2022.9772668
|View full text |Cite
|
Sign up to set email alerts
|

A 1036 TOp/s/W, 12.2 mW, 2.72 μJ/Inference All Digital TNN Accelerator in 22 nm FDX Technology for TinyML Applications

Abstract: Tiny Machine Learning (TinyML) applications impose µJ/Inference constraints, with maximum power consumption of a few tens of mW. It is extremely challenging to meet these requirement at a reasonable accuracy level. In this work, we address this challenge with a flexible, fully digital Ternary Neural Network (TNN) accelerator in a RISC-V-based SoC. The design achieves 2.72 µJ/Inference, 12.2 mW, 3200 Inferences/sec at 0.5 V for a non-trivial 9-layer, 96 channels-per-layer network with CIFAR-10 accuracy of 86 %.… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
3
2

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(1 citation statement)
references
References 5 publications
0
1
0
Order By: Relevance
“…These trade-offs stem from finding the balance between two key considerations: the memory and compute limitations of the target platform and selecting a trained model that achieves a competitive accuracy within acceptable latency [9]. Recently, dedicated Application Specific Integrated Circuits (ASIC) have been designed and validated for ultra-low power edge inference using NN variants [10], [11]. They present custom hardware capable of operating at near-threshold voltages with powergating to reduce active chip power during operations.…”
Section: Introductionmentioning
confidence: 99%
“…These trade-offs stem from finding the balance between two key considerations: the memory and compute limitations of the target platform and selecting a trained model that achieves a competitive accuracy within acceptable latency [9]. Recently, dedicated Application Specific Integrated Circuits (ASIC) have been designed and validated for ultra-low power edge inference using NN variants [10], [11]. They present custom hardware capable of operating at near-threshold voltages with powergating to reduce active chip power during operations.…”
Section: Introductionmentioning
confidence: 99%