49th International Conference on Parallel Processing - ICPP 2020
DOI: 10.1145/3404397.3404407
|View full text |Cite
|
Sign up to set email alerts
|

Extremely Low-bit Convolution Optimization for Quantized Neural Network on Modern Computer Architectures

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
7
2

Relationship

1
8

Authors

Journals

citations
Cited by 14 publications
(9 citation statements)
references
References 12 publications
0
9
0
Order By: Relevance
“…Layer-wise Reconstruction: Assume the Hessian matrix is layer-diagonal and optimize the layer output one-by-one. It does not consider cross-layer dependency and resemble existing methods (Nagel et al, 2020;Hubara et al, 2020;Wang et al, 2020). 2.…”
Section: Block Reconstructionmentioning
confidence: 99%
See 1 more Smart Citation
“…Layer-wise Reconstruction: Assume the Hessian matrix is layer-diagonal and optimize the layer output one-by-one. It does not consider cross-layer dependency and resemble existing methods (Nagel et al, 2020;Hubara et al, 2020;Wang et al, 2020). 2.…”
Section: Block Reconstructionmentioning
confidence: 99%
“…For the acquisition of mobile ARM CPU latency, we adopt the redesigned low-bit GEMM implementation in Han et al (2020). Fig.…”
Section: B43 Latency Acquisitionmentioning
confidence: 99%
“…[45] embed linear quantization directly into the Winograd domain to achieve low-precision quantization. [46] further explored the use of the Winograd algorithm to optimize the convolution kernel with 4-6bit precision. [47] applied quantization on feature map slices, and applied particle swarm optimization technology to find the threshold of quantization.…”
Section: Low Precision and Quantizationmentioning
confidence: 99%
“…Then, it is difficult to run large-scale DNNs on mobile devices. In order to use DNNs on mobile devices, a number of methods have been developed [4,8].…”
Section: Introductionmentioning
confidence: 99%