CoopNet: Cooperative Convolutional Neural Network for Low-Power MCUs

Mocerino, Luca; Calimera, Andrea

doi:10.1109/icecs46596.2019.8964993

Cited by 10 publications

(12 citation statements)

References 11 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Furthermore, our hardware and software design provides a true real-time parallel inference scheme, which allows its users to exploit all resources for the entire time. Such customization of accelerators and their concurrent execution are big advantages of FPGAs, and this is fundamentally different from the sequential approaches that previous studies [18,19,20] took, which could make one network idle when the other network is running.…”

Section: Multiple Neural Network Architecturementioning

confidence: 88%

“…However, in this study, both DSP and LUT utilizations are maximized in a flexible and efficient way by implementing two different networks with different main computation units. This is an aspect that CPU-based solutions [18] or even previous FPGA-based solutions [19,20] did not offer. Previous FPGA-based solutions either implemented only one of the networks on FPGAs [20], which resulted in LUT being a bottleneck, or did not use extremely low bit-width networks [19], which resulted in DSP being a bottleneck.…”

Section: Multiple Neural Network Architecturementioning

confidence: 97%

“…There have already been researches on this concept of hierarchical neural network design, where compressed and original networks are both used in neural network inference [18,19,20]. They have succeeded in achieving balanced accuracy and latency results by using different techniques, such as low-power MCU [18], or FPGA [19,20], to realize the concept. Although this work also mainly uses the concept of hierarchical neural network design, there are several major differences from previous studies.…”

Section: Multiple Neural Network Architecturementioning

confidence: 99%

See 2 more Smart Citations

TwinDNN: A Tale of Two Deep Neural Networks

Jeong

Chen

2021

2021 IEEE 32nd International Conference on Application-Specific Systems, Architectures and Processors (ASAP)

View full text Add to dashboard Cite

Section: Multiple Neural Network Architecturementioning

confidence: 88%

Section: Multiple Neural Network Architecturementioning

confidence: 97%

Section: Multiple Neural Network Architecturementioning

confidence: 99%

See 1 more Smart Citation

TwinDNN: A Tale of Two Deep Neural Networks

Jeong

Chen

2021

2021 IEEE 32nd International Conference on Application-Specific Systems, Architectures and Processors (ASAP)

View full text Add to dashboard Cite

“…Initially, elements of gradient-based learning were combined with other machine learning algorithms [12,19] to produce ultra-low memory classifiers (< 2KB). More recently, manually designed neural networks with quantisation and binarisation have been used for image classification on MCUs, too, though with a larger memory footprint [26,37]. Neural architecture search (NAS) is a widely explored topic in deep learning for GPUs.…”

Section: Related Workmentioning

confidence: 99%

Μnas

Liberis

Dudziak

Lane

2021

Proceedings of the 1st Workshop on Machine Learning and Systems

View full text Add to dashboard Cite

IoT devices are powered by microcontroller units (MCUs) which are extremely resource-scarce: a typical MCU may have an underpowered processor and around 64 KB of memory and persistent storage. Designing neural networks for such a platform requires an intricate balance between keeping high predictive performance (accuracy) while achieving low memory and storage usage and inference latency. This is extremely challenging to achieve manually, so in this work, we build a neural architecture search (NAS) system, called µNAS, to automate the design of such small-yet-powerful MCU-level networks. µNAS explicitly targets the three primary aspects of resource scarcity of MCUs: the size of RAM, persistent storage and processor speed. µNAS represents a significant advance in resource-efficient models, especially for "mid-tier" MCUs with memory requirements ranging from 0.5 KB to 64 KB. We show that on a variety of image classification datasets µNAS is able to (a) improve top-1 classification accuracy by up to 4.8%, or (b) reduce memory footprint by 4-13×, or (c) reduce the number of multiply-accumulate operations by at least 2×, compared to existing MCU specialist literature and resource-efficient models. µNAS is freely available for download at https://github.com/eliberis/uNAS CCS Concepts: • Computer systems organization → Embedded software; • Computing methodologies → Neural networks.

show abstract

“…There exist different works that proposed the use of ensemble methods for deep neural networks. Remarkable results are reported in [26], where the authors adopted a boosting strategy on image classification, but also in CoopNet [27] which combines multiple precision models to improve accuracy and inference latency. Even more interesting, the concept of ensemble learning can be found in the internal architecture of the most recent CNN models.…”

Section: Ensembles Learningmentioning

confidence: 99%

TentacleNet: A Pseudo-Ensemble Template for Accurate Binary Convolutional Neural Networks

Mocerino

Calimera

2020

2020 2nd IEEE International Conference on Artificial Intelligence Circuits and Systems (AICAS)

Self Cite

View full text Add to dashboard Cite

Binarization is an attractive strategy for implementing lightweight Deep Convolutional Neural Networks (CNNs). Despite the unquestionable savings offered, memory footprint above all, it may induce an excessive accuracy loss that prevents a widespread use. This work elaborates on this aspect introducing TentacleNet, a new template designed to improve the predictive performance of binarized CNNs via parallelization. Inspired by the ensemble learning theory, it consists of a compact topology that is end-to-end trainable and organized to minimize memory utilization. Experimental results collected over three realistic benchmarks show TentacleNet fills the gap left by classical binary models, ensuring substantial memory savings w.r.t. state-of-the-art binary ensemble methods.

show abstract

CoopNet: Cooperative Convolutional Neural Network for Low-Power MCUs

Cited by 10 publications

References 11 publications

TwinDNN: A Tale of Two Deep Neural Networks

TwinDNN: A Tale of Two Deep Neural Networks

Μnas

TentacleNet: A Pseudo-Ensemble Template for Accurate Binary Convolutional Neural Networks

Contact Info

Product

Resources

About