Yufei Huang scite author profile

Yufei Huang

5Publications

28Citation Statements Received

50Citation Statements Given

How they've been cited

How they cite others

114

Affiliations

Rutgers, The State University of New Jersey

Publications

Order By: Most citations

TR-BERT: Dynamic Token Reduction for Accelerating BERT Inference

Ye¹,

Lin²,

Huang³

et al. 2021

View full text Add to dashboard Cite

Existing pre-trained language models (PLMs) are often computationally expensive in inference, making them impractical in various resource-limited real-world applications. To address this issue, we propose a dynamic token reduction approach to accelerate PLMs' inference, named TR-BERT, which could flexibly adapt the layer number of each token in inference to avoid redundant calculation. Specially, TR-BERT formulates the token reduction process as a multi-step token selection problem and automatically learns the selection strategy via reinforcement learning. The experimental results on several downstream NLP tasks show that TR-BERT is able to speed up BERT by 2-5 times to satisfy various performance demands. Moreover, TR-BERT can also achieve better performance with less computation in a suite of long-text tasks since its token-level layer number adaption greatly accelerates the self-attention operation in PLMs. The source code and experiment details of this paper can be obtained from https://github.com/ thunlp/TR-BERT.

show abstract

TR-BERT: Dynamic Token Reduction for Accelerating BERT Inference

Ye¹,

Lin²,

Huang³

et al. 2021

Preprint

View full text Add to dashboard Cite

show abstract

A Distributed Multi-Agent Reinforcement Learning With Graph Decomposition Approach for Large-Scale Adaptive Traffic Signal Control

Jiang

Huang

Jafari

et al. 2022

IEEE Trans. Intell. Transport. Syst.

View full text Add to dashboard Cite

Driving Safety Prediction and Safe Route Mapping Using In-Vehicle and Roadside Data

Huang

Jafari²,

Jin³

2022

SSRN Journal

View full text Add to dashboard Cite

Analysis of the Optimal Adjustment Frequency for a Dual-Axis Solar Tracker

Gibson

Ibrahim

Lihn

et al. 2022

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Yufei Huang

TR-BERT: Dynamic Token Reduction for Accelerating BERT Inference

TR-BERT: Dynamic Token Reduction for Accelerating BERT Inference

A Distributed Multi-Agent Reinforcement Learning With Graph Decomposition Approach for Large-Scale Adaptive Traffic Signal Control

Driving Safety Prediction and Safe Route Mapping Using In-Vehicle and Roadside Data

Analysis of the Optimal Adjustment Frequency for a Dual-Axis Solar Tracker

Contact Info

Product

Resources

About