2022
DOI: 10.1109/access.2022.3187002
|View full text |Cite
|
Sign up to set email alerts
|

PTEENet: Post-Trained Early-Exit Neural Networks Augmentation for Inference Cost Optimization

Abstract: For many practical applications, a high computational cost of inference over deep network architectures might be unacceptable. A small degradation in the overall inference accuracy might be a reasonable price to pay for a significant reduction in the required computational resources. In this work, we describe a method for introducing ''shortcuts'' into the DNN feedforward inference process by skipping costly feedforward computations whenever possible. The proposed method is based on the previously described Br… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
references
References 27 publications
0
0
0
Order By: Relevance