When to Prune? A Policy towards Early Structural Pruning

Shen, Maying; Molchanov, Pavlo; Yin, Hongxu; Álvarez, Jose M.

doi:10.1109/cvpr52688.2022.01193

Cited by 31 publications

(8 citation statements)

References 23 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…DNNShifter is primarily limited by the high computation cost of training sparse models. There is potential for structured pruning to be conducted at the initialisation of the model (before training) with minimal accuracy loss [49,50]. This will be explored in the future.…”

Section: Discussionmentioning

confidence: 99%

Dnnshifter: An Efficient Dnn Pruning System for Edge Computing

Eccles,

Rodgers,

Kilpatrick

et al. 2023

Preprint

View full text Add to dashboard Cite

Section: Discussionmentioning

confidence: 99%

Dnnshifter: An Efficient Dnn Pruning System for Edge Computing

Eccles,

Rodgers,

Kilpatrick

et al. 2023

Preprint

View full text Add to dashboard Cite

“…Li et al [ 41 ] determined channel configuration using a random search. Shen et al [ 42 ] pruned channels globally based on magnitude and gradient criteria. Unlike pruning-only methods, Hou et al [ 43 ] proposed a pruning-and-regrowing method to avoid removing important channels.…”

Section: Related Workmentioning

confidence: 99%

Lossless Reconstruction of Convolutional Neural Network for Channel-Based Network Pruning

Lee

Hwang

2023

Sensors

View full text Add to dashboard Cite

Network pruning reduces the number of parameters and computational costs of convolutional neural networks while maintaining high performance. Although existing pruning methods have achieved excellent results, they do not consider reconstruction after pruning in order to apply the network to actual devices. This study proposes a reconstruction process for channel-based network pruning. For lossless reconstruction, we focus on three components of the network: the residual block, skip connection, and convolution layer. Union operation and index alignment are applied to the residual block and skip connection, respectively. Furthermore, we reconstruct a compressed convolution layer by considering batch normalization. We apply our method to existing channel-based pruning methods for downstream tasks such as image classification, object detection, and semantic segmentation. Experimental results show that compressing a large model has a 1.93% higher accuracy in image classification, 2.2 higher mean Intersection over Union (mIoU) in semantic segmentation, and 0.054 higher mean Average Precision (mAP) in object detection than well-designed small models. Moreover, we demonstrate that our method can reduce the actual latency by 8.15× and 5.29× on Raspberry Pi and Jetson Nano, respectively.

show abstract

“…Provided there are N tokens of d dimension corresponding with the image patches, the self-attention to correlate every couple from the permutation of the N tokens will result in O(N 2 d) complexity in a simple updating round. For deploying Transformer on edge devices, a variety of simplified models have been proposed, aiming to reduce parameters and operations, for example, parameter pruning [12,28], low-rank factorization [38], and knowledge distillation [24,35]. Yet, these strategies for acceleration are limited in that they still rely on CNN, which deviates from the original design of Transformer, that is, facilitating deep learning with a new working mechanism other than CNN.…”

Section: Related Workmentioning

confidence: 99%

Research on Intelligent Management of Laboratory Information Technology

Wang¹

2022

Procedia Computer Science

View full text Add to dashboard Cite

When to Prune? A Policy towards Early Structural Pruning

Cited by 31 publications

References 23 publications

Dnnshifter: An Efficient Dnn Pruning System for Edge Computing

Dnnshifter: An Efficient Dnn Pruning System for Edge Computing

Lossless Reconstruction of Convolutional Neural Network for Channel-Based Network Pruning

Research on Intelligent Management of Laboratory Information Technology

Contact Info

Product

Resources

About