2022
DOI: 10.3390/s22114081
|View full text |Cite
|
Sign up to set email alerts
|

MobilePrune: Neural Network Compression via ℓ0 Sparse Group Lasso on the Mobile System

Abstract: It is hard to directly deploy deep learning models on today’s smartphones due to the substantial computational costs introduced by millions of parameters. To compress the model, we develop an ℓ0-based sparse group lasso model called MobilePrune which can generate extremely compact neural network models for both desktop and mobile platforms. We adopt group lasso penalty to enforce sparsity at the group level to benefit General Matrix Multiply (GEMM) and develop the very first algorithm that can optimize the ℓ0 … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
3
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 27 publications
0
3
0
Order By: Relevance
“…If the terrain condition is not considered, the detected road might be risky for UGV. Additionally, the inference speed of our method needs further improvement, and we will accelerate and prune the model to improve the inference speed [36].…”
Section: Discussionmentioning
confidence: 99%
“…If the terrain condition is not considered, the detected road might be risky for UGV. Additionally, the inference speed of our method needs further improvement, and we will accelerate and prune the model to improve the inference speed [36].…”
Section: Discussionmentioning
confidence: 99%
“…However, model deployment is sometimes costly due to the large number of parameters in the DNN. To address this problem, many methods [ 1 , 2 , 3 , 4 , 5 , 6 ] have been proposed to compress networks and reduce computational quantities. These methods are mainly divided into two categories: structured pruning and unstructured pruning, in which the main method of structured pruning is the filter pruning, while the unstructured pruning method is mainly achieved by weight pruning.…”
Section: Introductionmentioning
confidence: 99%
“…Convolutional Neural Networks (CNNs), a type of deep learning model, have demonstrated remarkable success in various computer vision tasks, including image classification, object detection, and segmentation [3]. Nevertheless, these models are often large and demand substantial computational resources for training and inference, which may hinder their deployment on resource-constrained devices like mobile phones and embedded systems [4].…”
mentioning
confidence: 99%