Fast CP‐compression layer: Tensor CP‐decomposition to compress layers in deep learning

Ji, Yuwang; Wang, Qiang

doi:10.1049/ipr2.12507

Cited by 3 publications

(2 citation statements)

References 24 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…To verify the superiority of the proposed method, HMC is compared with the single compression algorithm and the additive hybrid compression algorithm, where tensor decomposition algorithms include Tucker [ 43 ], CP [ 44 ], TT [ 45 ], and MUSCO [ 28 ]. Structured pruning algorithms include Hrank [ 16 ], CHEX [ 17 ], DepGraph [ 18 ]; The additive hybrid compression algorithm includes literature [ 41 ] and ATMC [ 38 ].…”

Section: Methodsmentioning

confidence: 99%

See 1 more Smart Citation

HMC: Hybrid model compression method based on layer sensitivity grouping

Yang,

Yu,

Yang

et al. 2023

PLoS ONE

View full text Add to dashboard Cite

Previous studies have shown that deep models are often over-parameterized, and this parameter redundancy makes deep compression possible. The redundancy of model weight is often manifested as low rank and sparsity. Ignoring any part of the two or the different distributions of these two characteristics in the model will lead to low accuracy and a low compression rate of deep compression. To make full use of the difference between low-rank and sparsity, a unified framework combining low-rank tensor decomposition and structured pruning is proposed: a hybrid model compression method based on sensitivity grouping (HMC). This framework unifies the existing additive hybrid compression method (AHC) and the non-additive hybrid compression method (NaHC) proposed by us into one model. The latter group the network according to the sensitivity difference of the convolutional layer to different compression methods, which can better integrate the low rank and sparsity of the model compared with the former. Experiments show that our approach achieves a better trade-off between test accuracy and compression ratio when compressing the ResNet family of models than other recent compression methods using a single strategy or additive hybrid compression.

show abstract

Section: Methodsmentioning

confidence: 99%