PTEENet: Post-Trained Early-Exit Neural Networks Augmentation for Inference Cost Optimization

Lahiany, Assaf; Aperstein, Yehudit

doi:10.1109/access.2022.3187002

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2024

Publication Types

Select...

Book1

Article1

Relationship

Self Cite0

Independent2

Authors

Journals

Cited by 2 publications

References 27 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

Towards a Flexible Accuracy-Oriented Deep Learning Module Inference Latency Prediction Framework for Adaptive Optimization Algorithms

Shen,

Tziritas,

Theodoropoulos

2024

IFIP Advances in Information and Communication Technology

View full text Add to dashboard Cite

Towards a Flexible Accuracy-Oriented Deep Learning Module Inference Latency Prediction Framework for Adaptive Optimization Algorithms

Shen,

Tziritas,

Theodoropoulos

2024

IFIP Advances in Information and Communication Technology

View full text Add to dashboard Cite

Early-Exit Deep Neural Network - A Comprehensive Survey

Rahmath P,

Srivastava,

Chaurasia

et al. 2024

ACM Comput. Surv.

View full text Add to dashboard Cite

Deep neural networks (DNNs) typically have a single exit point that makes predictions by running the entire stack of neural layers. Since not all inputs require the same amount of computation to reach a confident prediction, recent research has focused on incorporating multiple ”exits” into the conventional DNN architecture. Early-exit DNNs are multi-exit neural networks that attach many side branches to the conventional DNN, enabling inference to stop early at intermediate points. This approach offers several advantages, including speeding up the inference process, mitigating the vanishing gradients problems, reducing overfitting and overthinking tendencies. It also supports DNN partitioning across devices and is ideal for multi-tier computation platforms such as edge computing. This paper decomposes the early-exit DNN architecture and reviews the recent advances in the field. The study explores its benefits, designs, training strategies, and adaptive inference mechanisms. Various design challenges, application scenarios, and future directions are also extensively discussed.

show abstract

PTEENet: Post-Trained Early-Exit Neural Networks Augmentation for Inference Cost Optimization

Cited by 2 publications

References 27 publications

Towards a Flexible Accuracy-Oriented Deep Learning Module Inference Latency Prediction Framework for Adaptive Optimization Algorithms

Towards a Flexible Accuracy-Oriented Deep Learning Module Inference Latency Prediction Framework for Adaptive Optimization Algorithms

Early-Exit Deep Neural Network - A Comprehensive Survey

Contact Info

Product

Resources

About