Yuchen Zhuang scite author profile

Yuchen Zhuang

5Publications

66Citation Statements Received

146Citation Statements Given

How they've been cited

How they cite others

140

146

Affiliations

Georgia Institute of Technology, Southeast University, First Affiliated Hospital of Chongqing Medical University

Publications

Order By: Most citations

Calibrated Language Model Fine-Tuning for In- and Out-of-Distribution Data

Kong

Jiang

Zhuang³

et al. 2020

View full text Add to dashboard Cite

Fine-tuned pre-trained language models can suffer from severe miscalibration for both in-distribution and out-of-distribution (OOD) data due to over-parameterization. To mitigate this issue, we propose a regularized fine-tuning method.Our method introduces two types of regularization for better calibration: (1) On-manifold regularization, which generates pseudo on-manifold samples through interpolation within the data manifold. Augmented training with these pseudo samples imposes a smoothness regularization to improve in-distribution calibration. (2) Off-manifold regularization, which encourages the model to output uniform distributions for pseudo off-manifold samples to address the over-confidence issue for OOD data. Our experiments demonstrate that the proposed method outperforms existing calibration methods for text classification in terms of expectation calibration error, misclassification detection, and OOD detection on six datasets. Our code can be found at https://github.com/Lingkai-Kong/ Calibrated-BERT-Fine-Tuning.

show abstract

Calibrated Language Model Fine-Tuning for In- and Out-of-Distribution Data

Kong

Jiang

Zhuang³

et al. 2020

Preprint

View full text Add to dashboard Cite

Fine-tuned pre-trained language models can suffer from severe miscalibration for both in-distribution and out-of-distribution (OOD) data due to over-parameterization. To mitigate this issue, we propose a regularized fine-tuning method. Our method introduces two types of regularization for better calibration: (1) On-manifold regularization, which generates pseudo on-manifold samples through interpolation within the data manifold. Augmented training with these pseudo samples imposes a smoothness regularization to improve in-distribution calibration. (2) Off-manifold regularization, which encourages the model to output uniform distributions for pseudo off-manifold samples to address the over-confidence issue for OOD data. Our experiments demonstrate that the proposed method outperforms existing calibration methods for text classification in terms of expectation calibration error, misclassification detection, and OOD detection on six datasets. Our code can be found at https://github.com/Lingkai-Kong/ Calibrated-BERT-Fine-Tuning.

show abstract

DNA computing for combinational logic

Zhang

Zhuang

et al. 2018

Sci. China Inf. Sci.

View full text Add to dashboard Cite

With the progressive scale-down of semiconductor's feature size, people are looking forward to More Moore and More than Moore. In order to offer a possible alternative implementation process, people are trying to figure out a feasible transfer from silicon to molecular computing. Such transfer lies on bio-based modules programming with computer-like logic, aiming at realizing the Turing machine. To accomplish this, the DNA-based combinational logic is inevitably the first step we have taken care of. This timely overview paper introduces combinational logic synthesized in DNA computing from both analog and digital perspectives separately. State-of-the-art research progress is summarized for interested readers to quick understand DNA computing, initiate discussion on existing techniques and inspire innovation solutions. We hope this paper can pave the way for the future DNA computing synthesis.

show abstract

Synthesis of Probability Theory Based on Molecular Computation

Shen

Zhang

Ge³

et al. 2016

View full text Add to dashboard Cite

ReGen: Zero-Shot Text Classification via Training Data Generation with Progressive Dense Retrieval

Yu¹,

Zhuang²,

Zhang³

et al. 2023

View full text Add to dashboard Cite

With the development of large language models (LLMs), zero-shot learning has attracted much attention for various NLP tasks. Different from prior works that generate training data with billion-scale natural language generation (NLG) models, we propose a retrievalenhanced framework to create training data from a general-domain unlabeled corpus. To realize this, we first conduct contrastive pretraining to learn an unsupervised dense retriever for extracting most relevant documents using classdescriptive verbalizers. We then further propose two simple strategies, namely Verbalizer Augmentation with Demonstrations and Selfconsistency Guided Filtering to improve the topic coverage of the dataset while removing noisy examples. Experiments on nine datasets demonstrate that REGEN achieves 4.3% gain over strongest baselines and saves around 70% of the time when compared with baselines using large NLG models. Besides, REGEN can be naturally integrated with recently proposed large language models to boost performance 1 .

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Yuchen Zhuang

Calibrated Language Model Fine-Tuning for In- and Out-of-Distribution Data

Calibrated Language Model Fine-Tuning for In- and Out-of-Distribution Data

DNA computing for combinational logic

Synthesis of Probability Theory Based on Molecular Computation

ReGen: Zero-Shot Text Classification via Training Data Generation with Progressive Dense Retrieval

Contact Info

Product

Resources

About