Generative Low-bitwidth Data Free Quantization

Xu, Shoukai; Li, Haokun; Zhuang, Bohan; Jing, Liu; Cao, Jiezhang; Liang, Chuangrun; Tan, Mingkui

doi:10.48550/arxiv.2003.03603

Cited by 4 publications

(3 citation statements)

References 46 publications

(46 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Generative adversarial networks (GAN) [2,33] can generate images with high fidelity, but still need real images as a reference while training the generator, which is not fully data-free. Recently, Chen et al [5] and Xu et al [28] proposed to use a generator to synthesize images from a pre-trained model and simultaneously train the student network. Further, Yin et al [29] proposed to synthesize images from the pre-trained teacher network using regularization terms and Jensen-Shannon divergence loss.…”

Section: Related Workmentioning

confidence: 99%

Data-Free Neural Architecture Search via Recursive Label Calibration

Liu¹,

Shen²,

Long³

et al. 2021

Preprint

View full text Add to dashboard Cite

This paper aims to explore the feasibility of neural architecture search (NAS) given only a pre-trained model without using any original training data. This is an important circumstance for privacy protection, bias avoidance, etc., in real-world scenarios. To achieve this, we start by synthesizing usable data through recovering the knowledge from a pre-trained deep neural network. Then we use the synthesized data and their predicted soft-labels to guide neural architecture search. We identify that the NAS task requires the synthesized data (we target at image domain here) with enough semantics, diversity, and a minimal domain gap from the natural images. For semantics, we propose recursive label calibration to produce more informative outputs. For diversity, we propose a regional update strategy to generate more diverse and semantically-enriched synthetic data. For minimal domain gap, we use input and feature-level regularization to mimic the original data distribution in latent space. We instantiate our proposed framework with three popular NAS algorithms: DARTS [15], ProxylessNAS [4] and SPOS [8]. Surprisingly, our results demonstrate that the architectures discovered by searching with our synthetic data achieve accuracy that is comparable to, or even higher than, architectures discovered by searching from the original ones, for the first time, deriving the conclusion that NAS can be done effectively with no need of access to the original or called natural data if the synthesis method is well designed. Our code will be publicly available.

show abstract

Section: Related Workmentioning

confidence: 99%

Data-Free Neural Architecture Search via Recursive Label Calibration

Liu¹,

Shen²,

Long³

et al. 2021

Preprint

View full text Add to dashboard Cite

show abstract

“…DFC and ZeroQ proposed to synthesize calibration data by gradient descent under the supervision of BatchNorm statistics [10,3]. Generative Adversarial Networks (GAN) is also utilized to generate synthetic calibration data [37], in which the generator generates calibration data and the discriminator is the quantized model. One problem with these optimization-based methods is the high time complexity.…”

Section: Post-training Quantizationmentioning

confidence: 99%

“…ZeroQ showed its arXiv:2105.07331v1 [cs.LG] 16 May 2021 effectiveness on various datasets including ImageNet [7] and MSCOCO [23]. Another method is to use Generative Adversarial Networks (GAN) to create fake calibration data [37]. The proposed framework named GDFQ is trained with BatchNorm statistics loss, Cross Entropy classification loss and knowledge distillation loss.…”

Section: Introductionmentioning

confidence: 99%

Is In-Domain Data Really Needed? A Pilot Study on Cross-Domain Calibration for Network Quantization

Yang

2021

2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

View full text Add to dashboard Cite

Post-training quantization methods use a set of calibration data to compute quantization ranges for network parameters and activations. The calibration data usually comes from the training dataset which could be inaccessible due to sensitivity of the data. In this work, we want to study such a problem: can we use out-of-domain data to calibrate the trained networks without knowledge of the original dataset? Specifically, we go beyond the domain of natural images to include drastically different domains such as X-ray images, satellite images and ultrasound images. We find cross-domain calibration leads to surprisingly stable performance of quantized models on 10 tasks in different image domains with 13 different calibration datasets. We also find that the performance of quantized models is correlated with the similarity of the Gram matrices between the source and calibration domains, which can be used as a criterion to choose calibration set for better performance. We believe our research opens the door to borrow cross-domain knowledge for network quantization and compression.

show abstract

Is In-Domain Data Really Needed? A Pilot Study on Cross-Domain Calibration for Network Quantization

Yu¹,

Yang²

2021

Preprint

View full text Add to dashboard Cite

Generative Low-bitwidth Data Free Quantization

Cited by 4 publications

References 46 publications

Data-Free Neural Architecture Search via Recursive Label Calibration

Data-Free Neural Architecture Search via Recursive Label Calibration

Is In-Domain Data Really Needed? A Pilot Study on Cross-Domain Calibration for Network Quantization

Is In-Domain Data Really Needed? A Pilot Study on Cross-Domain Calibration for Network Quantization

Contact Info

Product

Resources

About