“…In particular, Zhang et al employ in [ 81 ] a scalar quantization approach, converting non-zero output values represented by 32-bit floating point numbers into lower-precision integer values. Moreover, when it comes to non-DNN-specific compression methods, the research of Zhang et al [ 81 ] emerges again in the analysis, supplementing quantization with the application of a classic compression method, namely, run-length encoding [ 97 ], on intermediate results in an effort to reduce transmission costs even further. In this regard, it is important to note that, while the pursued goal has definitely been to reduce the memory size of the data exchanged between nodes, there has also been an evident concern for avoiding or, at the very least, mitigating information loss in the compression–decompression process, something that we see in [ 81 ] with the use of run-length encoding, and also in [ 91 ], where Parthasarathy et al apply the LZ4 algorithm [ 98 ] for the same purpose, where both of them are lossless data compression methods.…”