2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023
DOI: 10.1109/cvpr52729.2023.01554
|View full text |Cite
|
Sign up to set email alerts
|

Bit-shrinking: Limiting Instantaneous Sharpness for Improving Post-training Quantization

Chen Lin,
Bo Peng,
Zheyang Li
et al.
Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
1
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(1 citation statement)
references
References 15 publications
0
1
0
Order By: Relevance
“…It can be divided into posttraining quantization (PTQ) and quantization-aware training (QAT). PTQ compresses a pretrained full-precision model into a low-bit model without retraining or fine-tuning [33][34][35]. However, PTQ often suffers from significant accuracy degradation due to the lack of weightaware training.…”
Section: Model Quantizationmentioning
confidence: 99%
“…It can be divided into posttraining quantization (PTQ) and quantization-aware training (QAT). PTQ compresses a pretrained full-precision model into a low-bit model without retraining or fine-tuning [33][34][35]. However, PTQ often suffers from significant accuracy degradation due to the lack of weightaware training.…”
Section: Model Quantizationmentioning
confidence: 99%