Jordi Fornt scite author profile

Jordi Fornt

4Publications

0Citation Statements Received

14Citation Statements Given

How they've been cited

How they cite others

Affiliations

Barcelona Supercomputing Center, Board of the Swiss Federal Institutes of Technology, Universitat Politècnica de Catalunya

Publications

Order By: Most citations

An Energy-Efficient GeMM-Based Convolution Accelerator With On-the-Fly im2col

Fornt

Fontova-Musté

Caro³

et al. 2023

IEEE Trans. VLSI Syst.

View full text Add to dashboard Cite

Systolic array architectures have recently emerged as successful accelerators for deep convolutional neural network (CNN) inference. Such architectures can be used to efficiently execute general matrix-matrix multiplications (GeMM), but computing convolutions with this primitive involves transforming the 3D input tensor into an equivalent matrix, which can lead to an inflation of the input data, increasing the off-chip memory traffic which is critical for energy efficiency. In this work, we propose a GeMM-based systolic array accelerator that uses a novel data feeder architecture to perform on-chip, on-the-fly convolution lowering (also known as im2col), supporting arbitrary tensor and kernel sizes as well as strided and dilated (or atrous) convolutions. By using our data feeder, we reduce memory transactions and required bandwidth on state-of-the-art CNNs by a factor of two, while only adding an area and power overhead of 4% and 7% respectively. An ASIC implementation of our accelerator in 22 nm technology fits in less than 1.1 mm 2 and reaches an energy efficiency of 1.10 TFLOP/sW with 16-bit floating point arithmetic.

show abstract

Two examples of approximate arithmetic to reduce hardware complexity and power consumption

Fornt

Jin

Etxezarreta

et al. 2022

View full text Add to dashboard Cite

As the end of Moore's Law approaches, electronic system designers must find ways to keep up with the ever increasing computational demands of the modern era. Some computationally intensive applications, such as multimedia processing, computer vision and artificial intelligence, present a unique feature that makes them especially suitable for hardware-level optimizations: their inherent robustness to noise and errors. This allows circuit designers to relax the constraint that arithmetic operations, such as multiplications and additions, must be completely accurate. Instead, approximations can be used in the arithmetic units, enabling system-level reductions in hardware area and power consumption, as well as improvements in performance, while hardly affecting the output of the final application. In this work, we explore two approximate arithmetic techniques. First, we consider approximations at the circuit design level by implementing several approximate multiplier units and evaluating their accuracy when used in executing YOLOv3, a state-of-the-art camera-based object detection deep neural network. Second, we apply the technique of overscaling to induce approximations in adder circuits by aggressively undervoltaging and overclocking them, and we compare the behavior of exact and approximate adders under these conditions. We find that, on one hand, some approximate multipliers are able to execute the YOLO network with almost no effect on the results, and on the other, approximate adder circuits are much more resilient to overscaling techniques than exact adders.

show abstract

Efficient Diverse Redundant DNNs for Autonomous Driving

Caro

Fornt

Abella

2023

View full text Add to dashboard Cite

An automotive case study on the limits of approximation for object detection

Caro

Tabani

Abella

et al. 2023

Journal of Systems Architecture

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Jordi Fornt

An Energy-Efficient GeMM-Based Convolution Accelerator With On-the-Fly im2col

Two examples of approximate arithmetic to reduce hardware complexity and power consumption

Efficient Diverse Redundant DNNs for Autonomous Driving

An automotive case study on the limits of approximation for object detection

Contact Info

Product

Resources

About