A Novel Deep Learning Model Compression Algorithm

Zhao, Ming; Li, Meng; Peng, Sheng‐Lung; Li, Jie

doi:10.3390/electronics11071066

Cited by 8 publications

(7 citation statements)

References 34 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Given the audio signal y(t), STFT representation at time t and frequency t, X(t, f ) can be obtained by Equation (1), where w(t) is the window function and τ is the integration variable. The instantaneous phase can be extracted from the STFT as in Equation (2). Then, the stretched phase is calculated as in Equation ( 3), where a is the time stretch factor.…”

Section: Function Main()mentioning

confidence: 99%

“…Although DL models are complex and resource-intensive, these models can be leveraged to fit into resource-constrained environments by techniques such as model pruning, quantization, and knowledge distillation [ 2 , 3 ]. These techniques, aimed at reducing both the size and computational complexity of DL models, come with a challenge in achieving an optimal trade-off between complexity reduction and sustained performance.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

A Comparative Study of Preprocessing and Model Compression Techniques in Deep Learning for Forest Sound Classification

Paranayapa,

Ranasinghe,

Ranmal

et al. 2024

Sensors

View full text Add to dashboard Cite

Deep-learning models play a significant role in modern software solutions, with the capabilities of handling complex tasks, improving accuracy, automating processes, and adapting to diverse domains, eventually contributing to advancements in various industries. This study provides a comparative study on deep-learning techniques that can also be deployed on resource-constrained edge devices. As a novel contribution, we analyze the performance of seven Convolutional Neural Network models in the context of data augmentation, feature extraction, and model compression using acoustic data. The results show that the best performers can achieve an optimal trade-off between model accuracy and size when compressed with weight and filter pruning followed by 8-bit quantization. In adherence to the study workflow utilizing the forest sound dataset, MobileNet-v3-small and ACDNet achieved accuracies of 87.95% and 85.64%, respectively, while maintaining compact sizes of 243 KB and 484 KB, respectively. Henceforth, this study concludes that CNNs can be optimized and compressed to be deployed in resource-constrained edge devices for classifying forest environment sounds.

show abstract

Section: Function Main()mentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

A Comparative Study of Preprocessing and Model Compression Techniques in Deep Learning for Forest Sound Classification

Paranayapa,

Ranasinghe,

Ranmal

et al. 2024

Sensors

View full text Add to dashboard Cite

show abstract

“…With limited resources and funding, the optimization of CNNs (Habib & Qureshi, 2022) is crucial to their scalability in OoCs. To address the issue of scalability, customized design of models, compression of model designs (Zhao et al, 2022), and iterative improvements to models over time as deep learning progresses are all actions that could be taken in order to increase computational efficiency.…”

Section: Scalabilitymentioning

confidence: 99%

The Application of Convolutional Neural Networks in Organ-on-a-Chip Technology: A Review

Anderson,

Sarmadi

2024

Preprint

View full text Add to dashboard Cite

Recent developments in microfluidics and biomaterials have enabled the creation of organs-on-chips (OoCs), which provide a controlled and lifelike in vitro microenvironment resembling an actual organ. Organ-on-a-chip technology offers a tremendous opportunity for efficient, cost-effective, and ethical drug testing and research. However, the high throughput and large quantities of data these complex microenvironments produce make it difficult for researchers to effectively analyze the data and draw valuable conclusions. Convolutional Neural Networks (CNNs) are most aptly positioned to address many of OoC’s challenges because of their ability to interpret microscopic images and facilitate the analytical process. Despite the growing field of AI, there have been a limited number of studies summarizing the various applications of CNNs in OoCs. This review aims to provide 1) an overview of the technology involved with CNNs and OoCs 2) an insight into the state-of-the-art applications of CNNs in OoCs including device parameters, predicting and tracking cell trajectories, super-resolution image segmentation, and image classification, and 3) an overview of existing challenges and opportunities ahead for clinical translation of this technology. Various applications of CNNs have been classified by the type of task. Different CNN models such as Faster R-CNNs, fully convolutional networks (FCNs), and Mask R-CNNs are explained and highlighted. This review article can be used as a resource for a better understanding of the potential of CNNs in biomedical research and clinical applications, particularly OoCs.

show abstract

“…However, the enormous numbers of computations and parameters of CNNs hinder further development. Thus, it is not practical to deploy heavy CNNs on resource-constrained computing devices, such as embedded systems and mobile devices [14][15][16]. To address the problems, substantial research efforts have been devoted to compression techniques: channel pruning [17][18][19][20], low-rank decomposition [21][22][23], and weight quantization [24,25].…”

Section: Introductionmentioning

confidence: 99%

Deep Learning Architecture Improvement Based on Dynamic Pruning and Layer Fusion

Meng

2023

Electronics

View full text Add to dashboard Cite

The heavy workload of current deep learning architectures significantly impedes the application of deep learning, especially on resource-constrained devices. Pruning has provided a promising solution to compressing the bloated deep learning models by removing the redundancies of the networks. However, existing pruning methods mainly focus on compressing the superfluous channels without considering layer-level redundancies, which results in the channel-pruned models still suffering from serious redundancies. To mitigate this problem, we propose an effective compression algorithm for deep learning models that uses both the channel-level and layer-level compression techniques to optimize the enormous deep learning models. In detail, the channels are dynamically pruned first, and then the model is further optimized by fusing the redundant layers. Only a minor performance loss results. The experimental results show that the computations of ResNet-110 are reduced by 80.05%, yet the accuracy is only decreased by 0.72%. Forty-eight convolutional layers could be discarded from ResNet-110 with no loss of performance, which fully demonstrates the efficiency of the proposal.

show abstract

A Novel Deep Learning Model Compression Algorithm

Cited by 8 publications

References 34 publications

A Comparative Study of Preprocessing and Model Compression Techniques in Deep Learning for Forest Sound Classification

A Comparative Study of Preprocessing and Model Compression Techniques in Deep Learning for Forest Sound Classification

The Application of Convolutional Neural Networks in Organ-on-a-Chip Technology: A Review

Deep Learning Architecture Improvement Based on Dynamic Pruning and Layer Fusion

Contact Info

Product

Resources

About