Only Train Once: A One-Shot Neural Network Training And Pruning Framework

Chen, Tianyi; Ji, Bo; Ding, Tianyu; Fang, Biyi; Wang, Guanyi; Zhu, Zhihui; Liang, Luming; Shi, Yixin; Sangbong, Yi; Tu, Xiaowei

doi:10.48550/arxiv.2107.07467

Cited by 1 publication

(10 citation statements)

References 65 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Pruning is classified as either unstructured/weight pruning [14] or structured pruning [15]- [22] according to the method of determining the importance of parameters and removing them. Unstructured pruning [14] removes parameters by determining the importance of each parameter according to the saliency score of each algorithm.…”

Section: B Pruningmentioning

confidence: 99%

“…Unstructured pruning also has the disadvantage of requiring specific library and hardware support for a sparse matrix. In contrast, structured pruning [15]- [22] judges the importance of network connections according to the saliency score of each algorithm based on larger units such as channels. Structured pruning does not have a sparse matrix so that an existing library can be used, and memory usage can be reduced without requiring additional hardware.…”

Section: B Pruningmentioning

confidence: 99%

“…In particular, this made CNNs difficult to use in embedded devices such as mobiles, autonomous vehicles, and drones, where resources are limited. To solve this problem, pruning is being actively researched [14]- [22] in a variety of studies on lightweight deep learning networks.…”

Section: Introductionmentioning

confidence: 99%

“…After training the network to figure out the importance of the parameters, unimportant parameters are removed according to their saliency scores by each algorithm. Moreover, pruning is divided into unstructured pruning [14] and structured pruning [15]- [22]. Unstructured pruning [14] judges the importance of each parameter individually and removes unnecessary parameters to obtain a sparse convolution structure.…”

Section: Introductionmentioning

confidence: 99%

“…Unstructured pruning [14] judges the importance of each parameter individually and removes unnecessary parameters to obtain a sparse convolution structure. Structured pruning [15]- [22] removes unnecessary parameters by judging the importance of a bundle of several parameters. In this paper, by using structured pruning, YOLOv5 (an object detection network), is light-weighted to generate a network with a significantly reduced number of parameters and amount of computation.…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Target Capacity Filter Pruning Method for Optimized Inference Time Based on YOLOv5 in Embedded Systems

et al. 2022

View full text Add to dashboard Cite

Recently, convolutional neural networks (CNNs), which exhibit excellent performance in the field of computer vision, have been in the spotlight. However, as the networks become wider for higher accuracy, the number of parameters and the computational costs increase exponentially. Therefore, it is challenging to use deep learning networks in embedded environments with limited resources, computational performance, and power. Moreover, CNNs consume a great deal of time for inference. To solve this problem, we propose a practical method for filter pruning to provide an optimal network architecture for target capacity and inference acceleration. After revealing the correlation between the inference time and the FLOPs, we proposed a method to generate a network with the desired inference time. Various object detection datasets were used to evaluate the performance of the proposed filter pruning method. The inference time of the pruned network was measured and analyzed using the NVIDIA Jetson Xavier NX platform. As a result of pruning the number of parameters and FLOPs of the YOLOv5 network in the PASCAL VOC dataset by 30%, 40%, and 50%, the mAP decreased by 0.6%, 2.3%, and 2.9%, respectively, while the inference time was improved by 14.3%, 26.4%, and 34.5%, respectively.

show abstract

Section: B Pruningmentioning

confidence: 99%

Section: B Pruningmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations