2020
DOI: 10.1109/tcsii.2020.2983648
|View full text |Cite
|
Sign up to set email alerts
|

CMix-NN: Mixed Low-Precision CNN Library for Memory-Constrained Edge Devices

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
79
0
1

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 102 publications
(80 citation statements)
references
References 12 publications
0
79
0
1
Order By: Relevance
“…Mixed precision can indeed provide a way to reduce the memory footprint of layers that do not need a high precision representation (using 8-bit for weights and activations), while keeping a higher precision (16-bit representation) for layers that need it. The CMix-NN [ 57 ] library already provides an implementation of convolution functions for various data types of configuration (in 2, 4 and 8 bits). To further improve power consumption and memory footprint, binary neural networks can also be considered.…”
Section: Discussionmentioning
confidence: 99%
“…Mixed precision can indeed provide a way to reduce the memory footprint of layers that do not need a high precision representation (using 8-bit for weights and activations), while keeping a higher precision (16-bit representation) for layers that need it. The CMix-NN [ 57 ] library already provides an implementation of convolution functions for various data types of configuration (in 2, 4 and 8 bits). To further improve power consumption and memory footprint, binary neural networks can also be considered.…”
Section: Discussionmentioning
confidence: 99%
“…Notice that the accumulator φ linear I NT (w), I NT (x) is still integer-valued, but requires in general more bits than its inputs (i.e., ε φ will be smaller than ε x and ε w ). quant normalizes φ with an affine transformation of parameters κ and λ, then collapses its values, "converting" it to a representation with less bits (i.e., with bigger ε y ) 1 :…”
Section: Background 21 Qnns and Mixed-precision Qnnsmentioning
confidence: 99%
“…We implemented real-world applications, such as smart-web-cam, face-door, and lights-after-dark, at first hand. Real-world applications are implemented under the assumption that the lightweight JavaScript engine and external libraries [5]- [10] run together. Smart-web-cam recognizes a keyword by periodically sampling the sound sensor and the sensor's value into a keyword-spotting model [6], and then reports the camera frame to a server when a keyword is recognized.…”
Section: Evaluation a Experimental Settingsmentioning
confidence: 99%
“…Second, to support abundant functionality, such as DNNbased machine learning [5]- [8] or IoT connectivity standards [9], [10], several external libraries have been used by recent low-end devices. As these libraries use large buffers, such as intermediate feature map buffers [5]- [8] or request message buffers [9], [10], they require a large amount of memory space. Therefore, to use these libraries within the limited memory capacity of the MCUs, the libraries introduce buffer management optimization techniques [7]- [10] or DNN model compression techniques [5]- [8].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation