Cascading structured pruning

Hanson, Edward; Li, Shiyu; Li, Hai; Chen, Yiran

doi:10.1145/3470496.3527419

Cited by 15 publications

(1 citation statement)

References 29 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Structured pruning reduces computational complexity, simplifies sparse matrix computations, and is easier to use across different deep learning frameworks. Consequently, recent research has been inclined towards employing structured pruning algorithms for model pruning [62][63][64][65][66][67].…”

Section: Analysis and Discussionmentioning

confidence: 99%

A Review of Neural Network Lightweighting Techniques

Gong,

Zhang,

Yang

et al. 2024

Innov. technol. adv.

View full text Add to dashboard Cite

The application of portable devices based on deep learning has become increasingly widespread, which has made the deployment of complex neural networks on embedded devices a hot research topic. Neural network lightweighting is one of the key technologies for applying neural networks to embedded devices. This paper elaborates and analyzes neural network lightweighting techniques from two aspects: model pruning and network structure design. For model pruning, a comparison of methods from different periods is conducted, highlighting their advantages and limitations. Regarding network structure design, the principles of four classical lightweight network designs are described from a mathematical perspective, and the latest optimization methods for these networks are reviewed. Finally, potential research directions for lightweight neural network pruning and structure design optimization are discussed.

show abstract