Chufan Wu scite author profile

Chufan Wu

2Publications

60Citation Statements Received

11Citation Statements Given

How they've been cited

How they cite others

Affiliations

Publications

Order By: Most citations

LadaBERT: Lightweight Adaptation of BERT through Hybrid Model Compression

Mao¹,

Wang²,

Wu³

et al. 2020

View full text Add to dashboard Cite

BERT is a cutting-edge language representation model pre-trained by a large corpus, which achieves superior performances on various natural language understanding tasks. However, a major blocking issue of applying BERT to online services is that it is memory-intensive and leads to unsatisfactory latency of user requests. Existing solutions leverage knowledge distillation frameworks to learn smaller models that imitate the behaviors of BERT. However, the training procedure of knowledge distillation is expensive itself as it requires sufficient training data to imitate the teacher model. In this paper, we address this issue by proposing a hybrid solution named LadaBERT (Lightweight adaptation of BERT through hybrid model compression), which combines the advantages of different model compression methods, including weight pruning, matrix factorization and knowledge distillation. LadaBERT achieves state-of-the-art accuracy on various public datasets while the training overheads can be reduced by an order of magnitude.

show abstract

LadaBERT: Lightweight Adaptation of BERT through Hybrid Model Compression

Mao¹,

Wang²,

Wu³

et al. 2020

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Chufan Wu

LadaBERT: Lightweight Adaptation of BERT through Hybrid Model Compression

LadaBERT: Lightweight Adaptation of BERT through Hybrid Model Compression

Contact Info

Product

Resources

About