Pruning from Scratch

Wang, Yulong; Zhang, Xiaolu; Xie, Lingxi; Zhou, Jun; Su, Hang; Zhang, Bo; Hu, Xiaolin

doi:10.1609/aaai.v34i07.6910

Cited by 133 publications

(71 citation statements)

References 13 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…(3) Unlike [7], only oneshot pruning is adopted by FL-PQSU, as the further pruning in federated training incurs additional overhead, but contributes little to performance improvement [13]. According to our testing experiments, 1 -norm based pruning outperforms blind model shrinking by about 1% accuracy loss for large models like VGG16, which can't be simply neglected [16].…”

Section: Structured Pruningmentioning

confidence: 99%

“…Unlike these previous approaches, Play and Prune [15] allows to specify the error tolerance limit instead of the pruning ratio for each layer. Wang et al [16] verify that pruning from randomly initialized weights directly can result in more diverse pruned structures with competitive performance. More recent studies [7], [17] proposed to prune the model during training for not only improving the performance of inference but also reducing the costs of training.…”

Section: B Dnn Model Compressionmentioning

confidence: 99%

“…This is the most common case in FL [3]. Fortunately, according to [16], the model pruning can be directly started from randomly initialized weights, and model convergence is guaranteed.…”

Section: B Fl-pqsu Frameworkmentioning

confidence: 99%

See 2 more Smart Citations

Accelerating Federated Learning for IoT in Big Data Analytics With Pruning, Quantization and Selective Updating

Fang

Ding

et al. 2021

IEEE Access

View full text Add to dashboard Cite

The ever-increasing number of Internet of Things (IoT) devices are continuously generating huge masses of data, but the current cloud-centric approach for IoT big data analysis has raised public concerns on both data privacy and network cost. Federated learning (FL) recently emerges as a promising technique to accommodate these concerns, by means of learning a global model by aggregating local updates from multiple devices without sharing the privacy-sensitive data. However, IoT devices usually have constrained computation resources and poor network connections, making it infeasible or very slow to train deep neural networks (DNNs) by following the FL pattern. To address this problem, we propose a new efficient FL framework called FL-PQSU in this paper. It is composed of 3-stage pipeline: structured pruning, weight quantization and selective updating, that work together to reduce the costs of computation, storage, and communication to accelerate the FL training process. We study FL-PQSU using popular DNN models (AlexNet, VGG16) and publicly available datasets (MNIST, CIFAR10), and demonstrate that it can well control the training overhead while still guaranteeing the learning performance.INDEX TERMS Federated learning, Internet of Things, big data, model compression, network pruning.

show abstract

Section: Structured Pruningmentioning

confidence: 99%

Section: B Dnn Model Compressionmentioning

confidence: 99%

See 1 more Smart Citation

Accelerating Federated Learning for IoT in Big Data Analytics With Pruning, Quantization and Selective Updating

Fang

Ding

et al. 2021

IEEE Access

View full text Add to dashboard Cite

show abstract

“…The implementation of our proposed compression technique 1 is assessed in terms of accuracy vs. parameter count against state-ofthe-art pruning approaches including [1,2,4,7,8,10,15,21,22,23]. We present strong results with our novel CNN compression applied to ResNet-50 [12] and MobileNet-V2 [13] architectures for image classification task on ImageNet ILSVRC dataset [24] (Sec.…”

Section: Fig 1 Tensor Reordering and Dct Compressionmentioning

confidence: 99%

Tensor Reordering for CNN Compression

Uličný

Krylov

Dahyot

2021

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

View full text Add to dashboard Cite

We show how parameter redundancy in Convolutional Neural Network (CNN) filters can be effectively reduced by pruning in spectral domain. Specifically, the representation extracted via Discrete Cosine Transform (DCT) is more conducive for pruning than the original space. By relying on a combination of weight tensor reshaping and reordering we achieve high levels of layer compression with just minor accuracy loss. Our approach is applied to compress pretrained CNNs and we show that minor additional fine-tuning allows our method to recover the original model performance after a significant parameter reduction. We validate our approach on ResNet-50 and MobileNet-V2 architectures for ImageNet classification task.

show abstract

“…Currently, there exist two main approaches to build lightweight networks. (1) The first approach uses network pruning techniques [ 9 ] or knowledge distillation [ 10 ] to achieve model compression and inference acceleration by removing redundant structures and parameters. Because the accuracy-focused models contain strategies that help overcome various problems encountered during training, such as overfitting, it is difficult to scale such a model down sufficiently without sacrificing accuracy.…”

Section: Introductionmentioning

confidence: 99%

KecNet: A Light Neural Network for Arrhythmia Classification Based on Knowledge Reinforcement

Gao

et al. 2021

Journal of Healthcare Engineering

View full text Add to dashboard Cite

Acquiring electrocardiographic (ECG) signals and performing arrhythmia classification in mobile device scenarios have the advantages of short response time, almost no network bandwidth consumption, and human resource savings. In recent years, deep neural networks have become a popular method to efficiently and accurately simulate nonlinear patterns of ECG data in a data-driven manner but require more resources. Therefore, it is crucial to design deep learning (DL) algorithms that are more suitable for resource-constrained mobile devices. In this paper, KecNet, a lightweight neural network construction scheme based on domain knowledge, is proposed to model ECG data by effectively leveraging signal analysis and medical knowledge. To evaluate the performance of KecNet, we use the Association for the Advancement of Medical Instrumentation (AAMI) protocol and the MIT-BIH arrhythmia database to classify five arrhythmia categories. The result shows that the ACC, SEN, and PRE achieve 99.31%, 99.45%, and 98.78%, respectively. In addition, it also possesses high robustness to noisy environments, low memory usage, and physical interpretability advantages. Benefiting from these advantages, KecNet can be applied in practice, especially wearable and lightweight mobile devices for arrhythmia classification.

show abstract

Pruning from Scratch

Cited by 133 publications

References 13 publications

Accelerating Federated Learning for IoT in Big Data Analytics With Pruning, Quantization and Selective Updating

Accelerating Federated Learning for IoT in Big Data Analytics With Pruning, Quantization and Selective Updating

Tensor Reordering for CNN Compression

KecNet: A Light Neural Network for Arrhythmia Classification Based on Knowledge Reinforcement

Contact Info

Product

Resources

About