Ze-Feng Gao scite author profile

Ze-Feng Gao

5Publications

36Citation Statements Received

84Citation Statements Given

How they've been cited

How they cite others

169

Affiliations

Renmin University of China

Publications

Order By: Most citations

Compressing deep neural networks by matrix product operators

Gao

Cheng

et al. 2020

Phys. Rev. Research

View full text Add to dashboard Cite

Long Short-Term Memory (LSTM) models are the building blocks of many state-of-the-art algorithms for Natural Language Processing (NLP). But, there are a large number of parameters in an LSTM model. This usually brings out a large amount of memory space needed for operating an LSTM model. Thus, an LSTM model usually requires a large amount of computational resources for training and predicting new data, suffering from computational inefficiencies. Here we propose an alternative LSTM model to reduce the number of parameters significantly by representing the weight parameters based on matrix product operators (MPO), which are used to characterize the local correlation in quantum states in physics. We further experimentally compare the compressed models based the MPO-LSTM model and the pruning method on sequence classification and sequence prediction tasks. The experimental results show that our proposed MPO-based method outperforms the pruning method.

show abstract

A Model Compression Method With Matrix Product Operators for Speech Enhancement

Sun

Gao

et al. 2020

IEEE/ACM Trans. Audio Speech Lang. Process.

View full text Add to dashboard Cite

Enabling Lightweight Fine-tuning for Pre-trained Language Model Compression based on Matrix Product Operators

Liu

Gao

Zhao

et al. 2021

Preprint

View full text Add to dashboard Cite

This paper presents a novel pre-trained language models (PLM) compression approach based on the matrix product operator (short as MPO) from quantum many-body physics. It can decompose an original matrix into central tensors (containing the core information) and auxiliary tensors (with only a small proportion of parameters). With the decomposed MPO structure, we propose a novel fine-tuning strategy by only updating the parameters from the auxiliary tensors, and design an optimization algorithm for MPO-based approximation over stacked network architectures. Our approach can be applied to the original or the compressed PLMs in a general way, which derives a lighter network and significantly reduces the parameters to be fine-tuned. Extensive experiments have demonstrated the effectiveness of the proposed approach in model compression, especially the reduction in finetuning parameters (91% reduction on average). The code to reproduce the results of this paper can be found at https://github.com/ RUCAIBox/MPOP.

show abstract

Enabling Lightweight Fine-tuning for Pre-trained Language Model Compression based on Matrix Product Operators

Liu¹,

Gao²,

Zhao³

et al. 2021

View full text Add to dashboard Cite

show abstract

Compressing LSTM Networks by Matrix Product Operators

Gao¹,

Sun²,

Lan³

et al. 2020

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Ze-Feng Gao

Compressing deep neural networks by matrix product operators

A Model Compression Method With Matrix Product Operators for Speech Enhancement

Enabling Lightweight Fine-tuning for Pre-trained Language Model Compression based on Matrix Product Operators

Enabling Lightweight Fine-tuning for Pre-trained Language Model Compression based on Matrix Product Operators

Compressing LSTM Networks by Matrix Product Operators

Contact Info

Product

Resources

About