Controlled-potential oxidation of aliphatic amides

The recent surge of interest in Deep Neural Networks (DNNs) has led to increasingly complex networks that tax computational and memory resources. Many DNNs presently use 16-bit or 32-bit floating point operations. Significant performance and power gains can be obtained when DNN accelerators support low-precision numerical formats. Despite considerable research, there is still a knowledge gap on how low-precision operations can be realized for both DNN training and inference. In this work, we propose a DNN architecture, Deep Positron, with posit numerical format operating successfully at ≤8 bits for inference. We propose a precision-adaptable FPGA soft core for exact multiply-and-accumulate for uniform comparison across three numerical formats, fixed, floating-point and posit. Preliminary results demonstrate that 8-bit posit has better accuracy than 8-bit fixed or floating-point for three different low-dimensional datasets. Moreover, the accuracy is comparable to 32-bit floatingpoint on a Xilinx Virtex-7 FPGA device. The trade-offs between DNN performance and hardware resources, i.e. latency, power, and resource utilization, show that posit outperforms in accuracy and latency at 8-bit and below.

show abstract

Performance-Efficiency Trade-off of Low-Precision Numerical Formats in Deep Neural Networks

Carmichael

Langroudi

Khazanov

et al. 2019

View full text Add to dashboard Cite

Deep neural networks (DNNs) have been demonstrated as effective prognostic models across various domains, e.g. natural language processing, computer vision, and genomics. However, modern-day DNNs demand high compute and memory storage for executing any reasonably complex task. To optimize the inference time and alleviate the power consumption of these networks, DNN accelerators with low-precision representations of data and DNN parameters are being actively studied. An interesting research question is in how low-precision networks can be ported to edge-devices with similar performance as high-precision networks. In this work, we employ the fixed-point, floating point, and posit numerical formats at ≤8-bit precision within a DNN accelerator, Deep Positron, with exact multiply-and-accumulate (EMAC) units for inference. A unified analysis quantifies the trade-offs between overall network efficiency and performance across five classification tasks.Our results indicate that posits are a natural fit for DNN inference, outperforming at ≤8-bit precision, and can be realized with competitive resource requirements relative to those of floating point.

show abstract

PositNN Framework: Tapered Precision Deep Learning Inference for the Edge

Langroudi

Carmichael

Gustafson

et al. 2019

View full text Add to dashboard Cite

Adaptive Posit: Parameter aware numerical format for deep learning inference on the edge

Langroudi

Karia

Gustafson

et al. 2020

View full text Add to dashboard Cite

TENT: Efficient Quantization of Neural Networks on the tiny Edge with Tapered FixEd PoiNT

Langroudi¹,

Karia²,

Pandit³

et al. 2021

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Hamed F. Langroudi

Deep Positron: A Deep Neural Network Using the Posit Number System

Performance-Efficiency Trade-off of Low-Precision Numerical Formats in Deep Neural Networks

PositNN Framework: Tapered Precision Deep Learning Inference for the Edge

Adaptive Posit: Parameter aware numerical format for deep learning inference on the edge

TENT: Efficient Quantization of Neural Networks on the tiny Edge with Tapered FixEd PoiNT

Contact Info

Product

Resources

About