Power-of-Two Quantization for Low Bitwidth and Hardware Compliant Neural Networks

Przewłocka-Rus, Dominika; Sarwar, Syed Shakib; Sumbul, H. Ekin; Li, Yuecheng; Salvo, B. De

doi:10.48550/arxiv.2203.05025

Cited by 4 publications

(11 citation statements)

References 5 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The PoT quantization is a logarithmic quantizer [87] designed to approximate the weights to the closest power of two in the range defined by the considered number of bits. Mathematically, we can represent the PoT quantization considering 2 BW elements as [84], [87], [88]:…”

Section: Quantizationmentioning

confidence: 99%

“…The latter is done by assuming that ∂W qc ∂W = 1. As stated in [88], this process is known as a Straight-Through Estimator, and it results in a smoother transition between consecutive quantization levels in the learning process.…”

Section: Quantizationmentioning

confidence: 99%

“…The forward propagation uses the quantized alphabet c q to generate the quantized weights W qc and, in the backpropagation, such a weight is "skipped" by imposing its gradient to be one (straight-through estimator, Ref. [88]).…”

Section: Weightsmentioning

confidence: 99%

See 2 more Smart Citations

Reducing Computational Complexity of Neural Networks in Optical Channel Equalization: From Concepts to Implementation

Freire

Napoli

Costa

et al. 2023

J. Lightwave Technol.

View full text Add to dashboard Cite

In this paper, a new methodology is proposed that allows for the low-complexity development of neural network (NN) based equalizers for the mitigation of impairments in highspeed coherent optical transmission systems. In this work, we provide a comprehensive description and comparison of various deep model compression approaches that have been applied to feed-forward and recurrent NN designs. Additionally, we evaluate the influence these strategies have on the performance of each NN equalizer. Quantization, weight clustering, pruning, and other cutting-edge strategies for model compression are taken into consideration. In this work, we propose and evaluate a Bayesian optimization-assisted compression, in which the hyperparameters of the compression are chosen to simultaneously reduce complexity and improve performance. Next, this paper presents four distinct metrics (RMpS, BoP, NABS, and NLGs) that are discussed here that can be used to evaluate the amount of computing complexity required by various compression algorithms. These measurements can serve as a benchmark for evaluating the relative effectiveness of various NN equalizers when compression approaches are used. In conclusion, the trade-off between the complexity of each compression approach and its performance is evaluated by utilizing both simulated and experimental data in order to complete the analysis. By utilizing optimal compression approaches, we show that it is possible to design an NN-based equalizer that is simpler to implement and has better performance than the conventional digital back-propagation (DBP) equalizer with only one step per span. This is accomplished by reducing the number of multipliers used in the NN equalizer after applying the weighted clustering and pruning algorithms. Furthermore, we demonstrate that an equalizer based on NN can also achieve superior performance while still maintaining the same degree of complexity as the full electronic chromatic dispersion compensation block. We conclude our analysis by highlighting open questions and existing challenges, as well as possible future research directions.

show abstract

Section: Quantizationmentioning

confidence: 99%

Section: Quantizationmentioning

confidence: 99%

See 1 more Smart Citation

Reducing Computational Complexity of Neural Networks in Optical Channel Equalization: From Concepts to Implementation

Freire

Napoli

Costa

et al. 2023

J. Lightwave Technol.

View full text Add to dashboard Cite

show abstract

“…5 . In the case of Power-of two (PoT) quantization, we have: X = 0, because each multiplication costs just a shift [42], [53]. Lastly, for the Additive Powers-of-Two (APoT) quantization, we have: X = n, where n denotes the number of additive terms.…”

Section: A Dense Layermentioning

confidence: 99%

“…Since APoT is a quantization scheme represented by a sum of PoT terms, APoT provides a smooth transition between PoT and uniform quantization. In various works, PoT was claimed to have very low complexity because the multiplications are replaced by just shifts [53], [69], [70]. However, when we consider that the multiplication in the uniform quantization can be represented by shifts and adders, and we have a fair metric like NABS to compare between different quantization techniques, the NABS when applying PoT is only around an order of magnitude lower than the NABS when using the uniform quantization.…”

Section: Comparative Analysis Of the Complexities For Each Nn Structurementioning

confidence: 99%

Computational Complexity Evaluation of Neural Network Applications in Signal Processing

Freire¹,

Srivallapanondh²,

Napoli³

et al. 2022

Preprint

View full text Add to dashboard Cite

In this paper, we provide a systematic approach for assessing and comparing the computational complexity of neural network layers in digital signal processing. We provide and link four software-to-hardware complexity measures, defining how the different complexity metrics relate to the layers' hyperparameters. This paper explains how to compute these four metrics for feed-forward and recurrent layers, and defines in which case we ought to use a particular metric depending on whether we characterize a more soft-or hardware-oriented application. One of the four metrics, called 'the number of additions and bit shifts (NABS)', is newly introduced for heterogeneous quantization. NABS characterizes the impact of not only the bitwidth used in the operation but also the type of quantization used in the arithmetical operations. We intend this work to serve as a baseline for the different levels (purposes) of complexity estimation related to the neural networks' application in real-time digital signal processing, aiming at unifying the computational complexity estimation.

show abstract

Computational Complexity Optimization of Neural Network-Based Equalizers in Digital Signal Processing: A Comprehensive Approach

Freire,

Srivallapanondh,

Spinnler

et al. 2024

J. Lightwave Technol.

View full text Add to dashboard Cite

Experimental results based on offline processing reported at optical conferences increasingly rely on neural network-based equalizers for accurate data recovery. However, achieving low-complexity implementations that are efficient for real-time digital signal processing remains a challenge. This paper addresses this critical need by proposing a systematic approach to designing and evaluating low-complexity neural network equalizers. Our approach focuses on three key phases: training, inference, and hardware synthesis. We provide a comprehensive review of existing methods for reducing complexity in each phase, enabling informed choices during design. For the training and inference phases, we introduce a novel methodology for quantifying complexity. This includes new metrics that bridge software-to-hardware considerations, revealing the relationship between complexity and specific neural network architectures and hyperparameters. We guide the calculation of these metrics for both feed-forward and recurrent layers, highlighting the appropriate choice depending on the application's focus (software or hardware). Finally, to demonstrate the practical benefits of our approach, we showcase how the computational complexity of neural network equalizers can be significantly reduced and measured for both teacher (biLSTM+CNN) and student (1D-CNN) architectures in different scenarios. This work aims to standardize the estimation and optimization of computational complexity for neural networks applied to real-time digital signal processing, paving the way for more efficient and deployable optical communication systems.

show abstract

Power-of-Two Quantization for Low Bitwidth and Hardware Compliant Neural Networks

Cited by 4 publications

References 5 publications

Reducing Computational Complexity of Neural Networks in Optical Channel Equalization: From Concepts to Implementation

Reducing Computational Complexity of Neural Networks in Optical Channel Equalization: From Concepts to Implementation

Computational Complexity Evaluation of Neural Network Applications in Signal Processing

Computational Complexity Optimization of Neural Network-Based Equalizers in Digital Signal Processing: A Comprehensive Approach

Contact Info

Product

Resources

About