Di Zhu scite author profile

Di Zhu

4Publications

7Citation Statements Received

31Citation Statements Given

How they've been cited

How they cite others

Affiliations

Nanjing University of Finance and Economics, Nanjing University, Beijing Information Science & Technology University

Publications

Order By: Most citations

Accelerating Deep Neural Networks by Combining Block-Circulant Matrices and Low-Precision Weights

Qin

Zhu

et al. 2019

Electronics

View full text Add to dashboard Cite

As a key ingredient of deep neural networks (DNNs), fully-connected (FC) layers are widely used in various artificial intelligence applications. However, there are many parameters in FC layers, so the efficient process of FC layers is restricted by memory bandwidth. In this paper, we propose a compression approach combining block-circulant matrix-based weight representation and power-of-two quantization. Applying block-circulant matrices in FC layers can reduce the storage complexity from O ( k 2 ) to O ( k ) . By quantizing the weights into integer powers of two, the multiplications in the reference can be replaced by shift and add operations. The memory usages of models for MNIST, CIFAR-10 and ImageNet can be compressed by 171 × , 2731 × and 128 × with minimal accuracy loss, respectively. A configurable parallel hardware architecture is then proposed for processing the compressed FC layers efficiently. Without multipliers, a block matrix-vector multiplication module (B-MV) is used as the computing kernel. The architecture is flexible to support FC layers of various compression ratios with small footprint. Simultaneously, the memory access can be significantly reduced by using the configurable architecture. Measurement results show that the accelerator has a processing power of 409.6 GOPS, and achieves 5.3 TOPS/W energy efficiency at 800 MHz.

show abstract

Team Water: The Champion of the RoboCup Middle Size League Competition 2013

Chen

Zhu

Tian

et al. 2014

View full text Add to dashboard Cite

Binary software vulnerability detection method based on attention mechanism

Han¹,

Pang²,

Zhou³

et al. 2020

View full text Add to dashboard Cite

An On-Chip 2-D DFT Accelerator Ultrasonic Wavefront for Convolutional Neural Networks

Teng

Raju

Zhu

et al. 2021

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Di Zhu

Accelerating Deep Neural Networks by Combining Block-Circulant Matrices and Low-Precision Weights

Team Water: The Champion of the RoboCup Middle Size League Competition 2013

Binary software vulnerability detection method based on attention mechanism

An On-Chip 2-D DFT Accelerator Ultrasonic Wavefront for Convolutional Neural Networks

Contact Info

Product

Resources

About