Fast Convex Pruning of Deep Neural Networks

Aghasi, Alireza; Abdi, Afshin; Romberg, Justin

doi:10.1137/19m1246468

Cited by 37 publications

(31 citation statements)

References 28 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…S. Ye et al [19] propose a progressive weight pruning approach and demonstrate high pruning rate by using partial pruning with moderate pruning rates. Aghasi et al [20] develop a convex post-processing technique that prunes a trained network layer by layer while preserving the internal responses.…”

Section: Background and Related Workmentioning

confidence: 99%

Hardware-Based Real-Time Deep Neural Network Lossless Weights Compression

Malach

Greenberg

Haiut³

2020

IEEE Access

View full text Add to dashboard Cite

Section: Background and Related Workmentioning

confidence: 99%

Hardware-Based Real-Time Deep Neural Network Lossless Weights Compression

Malach

Greenberg

Haiut³

2020

IEEE Access

View full text Add to dashboard Cite

“…In this section, we first introduce the topology of the feedforward neural network models, then we explain the pruning method [37] that has been used to generate the compressed deep learning models. After that, we illustrate our approach to synthesis the compressed models, and the selection mechanism used to filter the best ones.…”

Section: Ensyth: Synthesis Of Deep Learning Ensemblesmentioning

confidence: 99%

“…The accuracy of LeNet-5 baseline model on CIFAR-10 is 78.4% , CIFAR-5 is 73.3% and 90.3% on MNIST-FASHION. After that, we prune and fine tune the baseline models with Net-Trim [37]. Net-trim's has four hyperparameters: L1: apply L1 regularisation on model's weight; L2: apply L2 regularisation on model's weight; dropout: a factor used to ignore neurons during a training process randomly; Epsilon gain: has a direct effect on the accuracy as well the sparsity of the pruned model.…”

Section: Network Training and Pruningmentioning

confidence: 99%

EnSyth: A Pruning Approach to Synthesis of Deep Learning Ensembles

Alhalabi

Gaber

Basurra

2019

2019 IEEE International Conference on Systems, Man and Cybernetics (SMC)

View full text Add to dashboard Cite

Deep neural networks have achieved state-of-art performance in many domains including computer vision, natural language processing and self-driving cars. However, they are very computationally expensive and memory intensive which raises significant challenges when it comes to deploy or train them on strict latency applications or resource-limited environments. As a result, many attempts have been introduced to accelerate and compress deep learning models, however the majority were not able to maintain the same accuracy of the baseline models. In this paper, we describe EnSyth, a deep learning ensemble approach to enhance the predictability of compact neural network's models. First, we generate a set of diverse compressed deep learning models using different hyperparameters for a pruning method, after that we utilise ensemble learning to synthesise the outputs of the compressed models to compose a new pool of classifiers. Finally, we apply backward elimination on the generated pool to explore the best performing combinations of models. On CIFAR-10, CIFAR-5 data-sets with LeNet-5, EnSyth outperforms the predictability of the baseline model.

show abstract

“…Jin et al [32] extended this method by restoring the pruned weights, training the network again, and repeating the process. Rather than pruning by thresholding, Aghasi et al [1,2] proposed Net-Trim, which prunes an already trained network layer by layer using convex optimization in order to ensure that the layer inputs and outputs remain consistent with the original network. For CNNs in particular, filter or channel pruning is preferred because it significantly reduces the amount of weight parameters required compared to individual weight pruning.…”

Section: Introductionmentioning

confidence: 99%

Structured Sparsity of Convolutional Neural Networks via Nonconvex Sparse Group Regularization

Bui

Park

Zhang

et al. 2021

Front. Appl. Math. Stat.

View full text Add to dashboard Cite

Convolutional neural networks (CNN) have been hugely successful recently with superior accuracy and performance in various imaging applications, such as classification, object detection, and segmentation. However, a highly accurate CNN model requires millions of parameters to be trained and utilized. Even to increase its performance slightly would require significantly more parameters due to adding more layers and/or increasing the number of filters per layer. Apparently, many of these weight parameters turn out to be redundant and extraneous, so the original, dense model can be replaced by its compressed version attained by imposing inter- and intra-group sparsity onto the layer weights during training. In this paper, we propose a nonconvex family of sparse group lasso that blends nonconvex regularization (e.g., transformed ℓ1, ℓ1−ℓ2, and ℓ0) that induces sparsity onto the individual weights and ℓ2,1 regularization onto the output channels of a layer. We apply variable splitting onto the proposed regularization to develop an algorithm that consists of two steps per iteration: gradient descent and thresholding. Numerical experiments are demonstrated on various CNN architectures showcasing the effectiveness of the nonconvex family of sparse group lasso in network sparsification and test accuracy on par with the current state of the art.

show abstract

Fast Convex Pruning of Deep Neural Networks

Cited by 37 publications

References 28 publications

Hardware-Based Real-Time Deep Neural Network Lossless Weights Compression

Hardware-Based Real-Time Deep Neural Network Lossless Weights Compression

EnSyth: A Pruning Approach to Synthesis of Deep Learning Ensembles

Structured Sparsity of Convolutional Neural Networks via Nonconvex Sparse Group Regularization

Contact Info

Product

Resources

About