Hardware Approximate Techniques for Deep Neural Network Accelerators: A Survey

Giorgos, Armeniakos,; Zervakis, Georgios; Soudris, Dimitrios; Henkel, Jörg

doi:10.1145/3527156

Cited by 60 publications

(14 citation statements)

References 111 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Another aggressive resource reduction technique impacting accuracy is the use of approximate computing units to perform the required computation [282].…”

Section: ) Arithmetic Unitsmentioning

confidence: 99%

A Survey of FPGA-Based Vision Systems for Autonomous Cars

et al. 2022

View full text Add to dashboard Cite

On the road to making self-driving cars a reality, academic and industrial researchers are working hard to continue to increase safety while meeting technical and regulatory constraints Understanding the surrounding environment is a fundamental task in self-driving cars. It requires combining complex computer vision algorithms. Although state-of-the-art algorithms achieve good accuracy, their implementations often require powerful computing platforms with high power consumption. In some cases, the processing speed does not meet real-time constraints. FPGA platforms are often used to implement a category of latency-critical algorithms that demand maximum performance and energy efficiency. Since self-driving car computer vision functions fall into this category, one could expect to see a wide adoption of FPGAs in autonomous cars. In this paper, we survey the computer vision FPGA-based works from the literature targeting automotive applications over the last decade. Based on the survey, we identify the strengths and weaknesses of FPGAs in this domain and future research opportunities and challenges.INDEX TERMS autonomous car, feature extraction, field programmable gate arrays, image classification, object detection, reconfigurable architectures Self-driving functions can be implemented in several computing platforms such as ASICs, CPUs, DSPs, GPUs, FPGAs, or in heterogeneous platforms composed by any

show abstract

“…Another aggressive resource reduction technique impacting accuracy is the use of approximate computing units to perform the required computation [282].…”

Section: ) Arithmetic Unitsmentioning

confidence: 99%

A Survey of FPGA-Based Vision Systems for Autonomous Cars

et al. 2022

View full text Add to dashboard Cite

show abstract

“…Approximate computing has been heavily utilized on DNN inference [8]. Many works present mapping methodologies that balance out the computation accuracy-power consumption trade-off, and recent research has focused on the design and utilization of approximate multipliers on DNN inference.…”

Section: Related Workmentioning

confidence: 99%

“…A great amount of DNN operations can tolerate some degree of approximation [6], [7], and since the majority of DNN inference is spent on convolution and matrix multiplication operations, the design of approximate MAC units has attracted significant interest. Particularly, the majority of research has been focused on the design of approximate multipliers [8], as they are the most complex components of the MAC units and dominate energy consumption inside the unit. However, such multipliers are not application specific.…”

Section: Introductionmentioning

confidence: 99%

Energy-Efficient DNN Inference on Approximate Accelerators Through Formal Property Exploration

Spantidi

Zervakis

Anagnostopoulos

et al. 2022

IEEE Trans. Comput.-Aided Des. Integr. Circuits Syst.

Self Cite

View full text Add to dashboard Cite

Deep Neural Networks (DNNs) are being heavily utilized in modern applications and are putting energy-constraint devices to the test. To bypass high energy consumption issues, approximate computing has been employed in DNN accelerators to balance out the accuracy-energy reduction trade-off. However, the approximation-induced accuracy loss can be very high and drastically degrade the performance of the DNN. Therefore, there is a need for a fine-grain mechanism that would assign specific DNN operations to approximation in order to maintain acceptable DNN accuracy, while also achieving low energy consumption. In this paper, we present an automated framework for weight-to-approximation mapping enabling formal property exploration for approximate DNN accelerators. At the MAC unit level, our experimental evaluation surpassed already energyefficient mappings by more than ×2 in terms of energy gains, while also supporting significantly more fine-grain control over the introduced approximation.

show abstract

“…Consequently, their deployment as DNN accelerators in smart nanoscale applications becomes very challenging [7] e.g., complex DNN analytics in resource-constrained edge devices, where safety and energy are critical considerations, can lead to an unexpected energy outage which can jeopardize human lives. This problem can be solved by leveraging approximate computing [8], [9] -an inexact computing method that exploits the inherent error resilience of onboard applications for energy efficiency in DNN accelerators. However, approximate hardware acceleration is deemed to be inherently less reliable [10].…”

mentioning

confidence: 99%

“…Approximate computing-based deep neural network (AxDNN) accelerators are designed by incorporating inexact arithmetic units [11] [12], computation skipping [13], memory skipping [14], etc. Their fabrication at the nanoscale follows a sophisticated manufacturing process whose imperfections may result in manufacturing defects, such as process variations and permanent faults (stuck-at faults) [9]. As discussed in this paper, the permanent faults affect the compute units of AxDNN accelerators in every execution cycle and their presence as unmasked faults leads to serious failures in the whole system.…”

mentioning

confidence: 99%

Exposing Reliability Degradation and Mitigation in Approximate DNNs Under Permanent Faults

Siddique

Hoque

2023

IEEE Trans. VLSI Syst.

View full text Add to dashboard Cite

Approximate computing is known for enhancing deep neural network accelerators' energy efficiency by introducing inexactness with a tolerable accuracy loss. However, small accuracy variations may increase the sensitivity of these accelerators towards undesired subtle disturbances, such as permanent faults. The impact of permanent faults in accurate deep neural network (AccDNN) accelerators has been thoroughly investigated in the literature. Conversely, the impact of permanent faults and their mitigation in approximate DNN (AxDNN) accelerators is vastly under-explored. Towards this, we first present an extensive fault resilience analysis of approximate multi-layer perceptrons (MLPs) and convolutional neural networks (CNNs) using the state-of-the-art Evoapprox8b multipliers in GPU and TPU accelerators. Then, we propose a novel fault mitigation method, i.e., fault-aware retuning of weights (Fal-reTune). Fal-reTune retunes the weights using a weight mapping function in the presence of faults for improved classification accuracy. To evaluate the fault resilience and the effectiveness of our proposed mitigation method, we used the most widely used MNIST, Fashion-MNIST, and CIFAR10 datasets. Our results demonstrate that the permanent faults exacerbate the accuracy loss in AxDNNs compared to the AccDNN accelerators. For instance, a permanent fault in AxDNNs can lead to 56% accuracy loss, whereas the same faulty bit can lead to only 4% accuracy loss in AccDNN accelerators. We empirically show that our proposed Fal-reTune mitigation method improves the performance of AxDNNs up to 98%, even with fault rates of up to 50%. Furthermore, we observe that the fault resilience in AxDNNs is orthogonal to their energy efficiency.

show abstract

Hardware Approximate Techniques for Deep Neural Network Accelerators: A Survey

Cited by 60 publications

References 111 publications

A Survey of FPGA-Based Vision Systems for Autonomous Cars

A Survey of FPGA-Based Vision Systems for Autonomous Cars

Energy-Efficient DNN Inference on Approximate Accelerators Through Formal Property Exploration

Exposing Reliability Degradation and Mitigation in Approximate DNNs Under Permanent Faults

Contact Info

Product

Resources

About