Energy-Efficient DNN Inference on Approximate Accelerators Through Formal Property Exploration

Spantidi, Ourania; Zervakis, Georgios; Anagnostopoulos, Iraklis; Henkel, Jörg

doi:10.1109/tcad.2022.3197522

Cited by 4 publications

(2 citation statements)

References 39 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In the case of channel pruning, there are various methods for retraining the pruned model to recover the lost accuracy, such as fine-tuning the model on the pruned architecture or using knowledge distillation to transfer the knowledge from the original model to the pruned model [ 27 ]. However, these methods can be computationally expensive and time-consuming, and they may not be feasible or practical in all scenarios [ 18 ]. On the contrary, the proposed method does not involve retraining or fine-tuning; therefore, to keep evaluations fair, we avoid directly comparing our results with other pruning methods that do involve such techniques.…”

Section: Discussionmentioning

confidence: 99%

“…We combine the approaches presented in Section 2.1 and Section 2.2 to achieve lightweight DNN inference without compromises in terms of accuracy at run-time. We consider run-time scenarios where real-time performance is prioritized [ 18 ] and therefore target DNN inference where each image is processed independently rather than in big batches.…”

Section: Methodsmentioning

confidence: 99%

See 1 more Smart Citation

Dynamic Image Difficulty-Aware DNN Pruning

2023

Self Cite

View full text Add to dashboard Cite

Deep Neural Networks (DNNs) have achieved impressive performance in various image recognition tasks, but their large model sizes make them challenging to deploy on resource-constrained devices. In this paper, we propose a dynamic DNN pruning approach that takes into account the difficulty of the incoming images during inference. To evaluate the effectiveness of our method, we conducted experiments on the ImageNet dataset on several state-of-art DNNs. Our results show that the proposed approach reduces the model size and amount of DNN operations without the need to retrain or fine-tune the pruned model. Overall, our method provides a promising direction for designing efficient frameworks for lightweight DNN models that can adapt to the varying complexity of input images.

show abstract

Section: Discussionmentioning

confidence: 99%

Section: Methodsmentioning

confidence: 99%