Leveraging Mixed-Precision CNN Inference for Increased Robustness and Energy Efficiency

Hotfilter, Tim; Hoefer, Julian; Merz, Philipp; Kreß, Fabian; Kempf, Fabian; Harbaum, Tanja; Becker, Jürgen

doi:10.1109/socc58585.2023.10256738

2023 IEEE 36th International System-on-Chip Conference (SOCC) 2023

DOI: 10.1109/socc58585.2023.10256738

|View full text |Cite

Leveraging Mixed-Precision CNN Inference for Increased Robustness and Energy Efficiency

Tim Hotfilter,

Julian Hoefer,

Philipp Merz

et al.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2024

Publication Types

Select...

Article1

Relationship

Self Cite0

Independent1

Authors

Journals

Cited by 1 publication

References 15 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

High-Level Design of Precision-Scalable DNN Accelerators Based on Sum-Together Multipliers

Urbinati,

Casu

2024

IEEE Access

View full text Add to dashboard Cite

Precison-scalable (PS) multipliers are gaining traction in Deep Neural Network accelerators, particularly for enabling mixed-precision (MP) quantization in Deep Learning at the edge. This paper focuses on the Sum-Together (ST) class of PS multipliers, which are subword-parallel multipliers that can execute a standard multiplication at full precision or a dot-product with parallel low-precision operands. Our contributions in this area encompass multiple aspects: we enrich our previous comparison of SoA ST multipliers by including our recent radix-4 Booth ST multiplier and two novel designs; we extend the explanation of the architecture and the design flow of our previously proposed ST-based PS hardware accelerators designed for 2D-Convolution, Depth-wise Convolution, and Fully-Connected layers that we developed using High-Level Synthesis (HLS); we implement the uniform integer quantization equations in hardware; we conduct a broad HLS-driven design space exploration of our ST-based accelerators, varying numerous hardware parameters; finally, we showcase the advantages of ST-based accelerators when integrated into System-on-Chips (SoCs) in three different scenarios (low-area, low-power, and low-latency), running inference on MP-quantized MLPerf Tiny models as case study. Across the three scenarios, the results show an average latency speedup of 1.46x, 1.33x, and 1.29x, a reduced energy consumption in most of the cases, and a marginal area overhead of 0.9%, 2.5% and 8.0%, compared to SoCs with accelerators based on fixed-precision 16-bit multipliers. To sum up, our work provides a comprehensive understanding of ST-based accelerators' performance in an SoC context, paving the way for future enhancements and the solution of identified inefficiencies.

show abstract

High-Level Design of Precision-Scalable DNN Accelerators Based on Sum-Together Multipliers

Urbinati,

Casu

2024

IEEE Access

View full text Add to dashboard Cite

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Leveraging Mixed-Precision CNN Inference for Increased Robustness and Energy Efficiency

Cited by 1 publication

References 15 publications

High-Level Design of Precision-Scalable DNN Accelerators Based on Sum-Together Multipliers

High-Level Design of Precision-Scalable DNN Accelerators Based on Sum-Together Multipliers

Contact Info

Product

Resources

About