Constrained Optimization Based Low-Rank Approximation of Deep Neural Networks

Li, Chong; Shi, Chen

doi:10.1007/978-3-030-01249-6_45

Cited by 45 publications

(32 citation statements)

References 14 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Cosine Transform) 和由一维 DCT 变换和一维小波构造的小波系统。Denton 等人 [124] 提出了卷积核的低秩近似和聚类方法，实现单个卷积层 2 倍加速。Li 等人 [125] 提出用于寻找训练 CNN 的最优低秩近似，即基于约束优化的低秩近似，从而限制累加操作和内存占比。除此之外，仍然有很多基于低秩分解的加速方法 [29,30,126,127,128] ，虽然低秩分解对于模型加速和压缩简单有效，但其实现由于涉及分解操作，不仅带来额外的计算压力，同时还需要大量的模型重训练以实现收敛。并且不适用不包含卷积操作的神经网络，本文中不针对低秩分解方法进行特定分析。 2.5 神经网络架构搜索目前的压缩和加速方法主要针对特定的神经网络结构，但当我们考虑它们的组合时，可以探索的架构策略就会极具增加。神经网络架构搜索 [31,32] 就是为了对定义的神经网络不同组件的一组决策的搜索。是一种系统化、自动化的学习最佳神经网络架构的方法。根据 Elsken 等人 [129]…”

Section: 低秩分解利用矩阵或张量分解来估计神经网络中卷积层的权重参数。卷积核可以看作是一个三维张量，而低秩分解则是通过张量分解的思想来消除三维张量中存在的冗余。低秩分解来解决卷积加速问题已经有了很多研究，例如最早的高维 Dct (Discreteunclassified

Robustness analysis for compact neural networks

Chen¹,

Peng²

2022

Sci. Sin.-Tech.

View full text Add to dashboard Cite

Robustness analysis for compact neural networks

Chen¹,

Peng²

2022

Sci. Sin.-Tech.

View full text Add to dashboard Cite

“…In contrast, [4] achieves compression by combining channel wise low rank approximation and with separable one-dimensional spatial filters. In [7] multiply-accumulate operations are reduced via constrained optimization. Low rank factorization and pruning are combined in [8] by cascading the low rank projections of filters in the current layer to the next layer.…”

Section: Conventional Convolution Layer Basis Convolution Layermentioning

confidence: 99%

Compressing Deep CNNs Using Basis Representation and Spectral Fine-Tuning

Tayyab

Khan

Mahalanobis

2021

2021 IEEE International Conference on Image Processing (ICIP)

View full text Add to dashboard Cite

We propose an efficient and straightforward method for compressing deep convolutional neural networks (CNNs) that uses basis filters to represent the convolutional layers, and optimizes the performance of the compressed network directly in the basis space. Specifically, any spatial convolution layer of the CNN can be replaced by two successive convolution layers: the first is a set of three-dimensional orthonormal basis filters, followed by a layer of one-dimensional filters that represents the original spatial filters in the basis space. We jointly fine-tune both the basis and the filter representation to directly mitigate any performance loss due to the truncation. Generality of the proposed approach is demonstrated by applying it to several well known deep CNN architectures and data sets for image classification and object detection. We also present the execution time and power usage at different compression levels on the Xavier Jetson AGX processor.

show abstract

“…Two efficient low-rank decomposition schemes were proposed and used in the literature. The scheme 1 [3,5,[7][8][9] applies low-rank to an n×cd 2 sized reshape of the weight tensor W. The scheme 2 [1, Regular convolution Scheme 1…”

Section: Efficient Low-rank Decomposition Schemes Of Convolutional Layers Induced By Specific Matrix Reshapesmentioning

confidence: 99%

“…Out of many compression forms, recently, the low-rank compression re-emerged as an efficient way to achieve both compressed in size and reduced in FLOPs neural networks [1,2]. An ingredient that made low-rank more attractive for deep network compression is the research on an automatic way of setting the ranks of the decompositions [1][2][3], whereas previously, the ranks were fixed by hand [4,5] or heuristically estimated [6][7][8]. To further improve the performance of the low-rank compression, we address one overlooked ingredient -the choice of the weight reshaping when applying the decomposition.…”

Section: Introductionmentioning

confidence: 99%

Optimal Selection of Matrix Shape and Decomposition Scheme for Neural Network Compression

Idelbayev

Carreira-Perpiñán

2021

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

View full text Add to dashboard Cite

When applying the low-rank decomposition to neural networks, tensor-shaped weights need to be reshaped into a matrix first. While many matrix reshapes are possible, some of them induce a low-rank decomposition scheme that can be more efficiently implemented as a sequence of layers. This poses the following problem: how should one select both the matrix reshape and associated low-rank decomposition scheme in order to compress a neural network so that its implementation is as efficient as possible? We formulate this problem as a mixed-integer optimization over the weights, ranks, and decompositions schemes; and we provide an efficient alternating optimization algorithm involving two simple steps: a step over the weights of the neural network (solved by SGD), and a step over the ranks and decomposition schemes (solved by an SVD). Our algorithm automatically selects the most suitable ranks and decomposition schemes to efficiently reduce compression costs (e.g., FLOPs) of various networks.

show abstract

Constrained Optimization Based Low-Rank Approximation of Deep Neural Networks

Cited by 45 publications

References 14 publications

Robustness analysis for compact neural networks

Robustness analysis for compact neural networks

Compressing Deep CNNs Using Basis Representation and Spectral Fine-Tuning

Optimal Selection of Matrix Shape and Decomposition Scheme for Neural Network Compression

Contact Info

Product

Resources

About