Design of a 2-Bit Neural Network Quantizer for Laplacian Source

Perić, Zoran; Savic, Milan; Simic, Nikola; Denic, Bojan; Despotović, Vladimir

doi:10.3390/e23080933

Cited by 10 publications

(29 citation statements)

References 30 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Namely, NN normalization methods have already been confirmed to work well empirically [ 37 ], whereby a lot of them implicitly assume that the distributions of normalized NN parameters, primarily weights, initially have zero mean and unit variance. Keeping in mind that the weights’ distribution can closely fit some well-known probability density functions, such as the Laplacian probability density function (pdf) is [ 2 ], in this paper, as in [ 12 , 13 , 34 , 38 , 39 , 40 ], we assume the Laplacian-like distribution for experimental weights’ distribution and the Laplacian pdf for the theoretical distribution of weights to estimate the performance of the three-bit uniform quantizer (three-bit UQ) in question. Our motivation to address the simplest UQ also stems from the fact that UQs are not optimal for nonlinear distributions.…”

Section: Related Work and Motivationmentioning

confidence: 99%

“…For symmetrical data distributions, symmetrical quantizers with an even number of quantization steps are preferable [ 14 , 24 , 25 , 26 , 27 , 28 , 29 , 32 , 34 , 35 , 38 , 40 ]. Since it can be expected that most of the real data is asymmetrical, one could not conjecture that the preferable quantizer is also asymmetrical.…”

Section: Design Of Symmetric Three-bit Uniform Quantizer For the Purpose Of Nn Weights Compressionmentioning

confidence: 99%

“…As already mentioned, the weight distribution can closely fit some well-known pdfs, such as Laplacian pdf is [ 2 ]. Accordingly, as in [ 12 , 13 , 34 , 38 , 39 , 40 ], we assume the Laplacian-like distribution for the experimental distribution of weights and the Laplacian pdf for the theoretical distribution of weights to estimate the performance of our three-bit UQ in question.…”

Section: Design Of Symmetric Three-bit Uniform Quantizer For the Purpose Of Nn Weights Compressionmentioning

confidence: 99%

“…Similarly, as in numerous papers about quantization (for instance, in [ 6 , 9 , 22 , 23 , 24 , 25 , 26 , 27 , 28 , 29 , 30 , 32 , 34 , 35 , 36 , 38 , 40 ]), we are interested in conducting an analysis for the variance-matched case, where the designed for and applied to variance of data being quantized match and commonly amount to 1. Therefore, we further assume σ 2 = 1 so that we end up with the following novel closed-form formula for the distortion of the symmetrical three-bit UQ when the input has the Laplacian pdf of zero mean and unit variance:

Let us finally define the theoretical SQNR as:

which will also be calculated for σ 2 = 1 and compared with the experimentally determined SQNR ex UQ .…”

Section: Design Of Symmetric Three-bit Uniform Quantizer For the Purpose Of Nn Weights Compressionmentioning

confidence: 99%

See 3 more Smart Citations

Whether the Support Region of Three-Bit Uniform Quantizer Has a Strong Impact on Post-Training Quantization for MNIST Dataset?

Nikolić

Perić

Aleksić

et al. 2021

Entropy

Self Cite

View full text Add to dashboard Cite

Driven by the need for the compression of weights in neural networks (NNs), which is especially beneficial for edge devices with a constrained resource, and by the need to utilize the simplest possible quantization model, in this paper, we study the performance of three-bit post-training uniform quantization. The goal is to put various choices of the key parameter of the quantizer in question (support region threshold) in one place and provide a detailed overview of this choice’s impact on the performance of post-training quantization for the MNIST dataset. Specifically, we analyze whether it is possible to preserve the accuracy of the two NN models (MLP and CNN) to a great extent with the very simple three-bit uniform quantizer, regardless of the choice of the key parameter. Moreover, our goal is to answer the question of whether it is of the utmost importance in post-training three-bit uniform quantization, as it is in quantization, to determine the optimal support region threshold value of the quantizer to achieve some predefined accuracy of the quantized neural network (QNN). The results show that the choice of the support region threshold value of the three-bit uniform quantizer does not have such a strong impact on the accuracy of the QNNs, which is not the case with two-bit uniform post-training quantization, when applied in MLP for the same classification task. Accordingly, one can anticipate that due to this special property, the post-training quantization model in question can be greatly exploited.

show abstract

Section: Related Work and Motivationmentioning

confidence: 99%

Section: Design Of Symmetric Three-bit Uniform Quantizer For the Purpose Of Nn Weights Compressionmentioning

confidence: 99%

Section: Design Of Symmetric Three-bit Uniform Quantizer For the Purpose Of Nn Weights Compressionmentioning

confidence: 99%

Let us finally define the theoretical SQNR as:

which will also be calculated for σ 2 = 1 and compared with the experimentally determined SQNR ex UQ .…”

Section: Design Of Symmetric Three-bit Uniform Quantizer For the Purpose Of Nn Weights Compressionmentioning

confidence: 99%

See 2 more Smart Citations

Whether the Support Region of Three-Bit Uniform Quantizer Has a Strong Impact on Post-Training Quantization for MNIST Dataset?

Nikolić

Perić

Aleksić

et al. 2021

Entropy

Self Cite

View full text Add to dashboard Cite

show abstract

“…It is well known that a nonuniform quantizer model, well accommodated to the signal's amplitude dynamic and a nonuniform pdf, has lower quantization error compared to the uniform quantizer (UQ) model with an equal number of quantization levels or equal bit-rates [2,11,13,18,[20][21][22][23][24][25][26][27]. However, due to the fact that UQ is the simplest quantizer model, it has been intensively studied, for instance in [23,24,[28][29][30][31][32].…”

Section: Introductionmentioning

confidence: 99%

Iterative Algorithm for Parameterization of Two-Region Piecewise Uniform Quantizer for the Laplacian Source

et al. 2021

Self Cite

View full text Add to dashboard Cite

Motivated by the fact that uniform quantization is not suitable for signals having non-uniform probability density functions (pdfs), as the Laplacian pdf is, in this paper we have divided the support region of the quantizer into two disjunctive regions and utilized the simplest uniform quantization with equal bit-rates within both regions. In particular, we assumed a narrow central granular region (CGR) covering the peak of the Laplacian pdf and a wider peripheral granular region (PGR) where the pdf is predominantly tailed. We performed optimization of the widths of CGR and PGR via distortion optimization per border–clipping threshold scaling ratio which resulted in an iterative formula enabling the parametrization of our piecewise uniform quantizer (PWUQ). For medium and high bit-rates, we demonstrated the convenience of our PWUQ over the uniform quantizer, paying special attention to the case where 99.99% of the signal amplitudes belong to the support region or clipping region. We believe that the resulting formulas for PWUQ design and performance assessment are greatly beneficial in neural networks where weights and activations are typically modelled by the Laplacian distribution, and where uniform quantization is commonly used to decrease memory footprint.

show abstract

Analysis of Neural Network Accuracy Degradation due to Uniform Weight Quantization of One or More Layers

Nikolić

Tomić²,

Perić

et al. 2022

2022 57th International Scientific Conference on Information, Communication and Energy Systems and Technologies (ICEST)

View full text Add to dashboard Cite

Design of a 2-Bit Neural Network Quantizer for Laplacian Source

Cited by 10 publications

References 30 publications

Whether the Support Region of Three-Bit Uniform Quantizer Has a Strong Impact on Post-Training Quantization for MNIST Dataset?

Whether the Support Region of Three-Bit Uniform Quantizer Has a Strong Impact on Post-Training Quantization for MNIST Dataset?

Iterative Algorithm for Parameterization of Two-Region Piecewise Uniform Quantizer for the Laplacian Source

Analysis of Neural Network Accuracy Degradation due to Uniform Weight Quantization of One or More Layers

Contact Info

Product

Resources

About