Hui-Syuan Yeh scite author profile

Hui-Syuan Yeh

5Publications

13Citation Statements Received

61Citation Statements Given

How they've been cited

How they cite others

Affiliations

Saarland University

Publications

Order By: Most citations

On Training Instance Selection for Few-Shot Neural Text Generation

Chang¹,

Shen²,

Yeh³

et al. 2021

View full text Add to dashboard Cite

Large-scale pretrained language models have led to dramatic improvements in text generation. Impressive performance can be achieved by finetuning only on a small number of instances (few-shot setting). Nonetheless, almost all previous work simply applies random sampling to select the few-shot training instances. Little to no attention has been paid to the selection strategies and how they would affect model performance. In this work, we present a study on training instance selection in few-shot neural text generation. The selection decision is made based only on the unlabeled data so as to identify the most worthwhile data points that should be annotated under some budget of labeling cost. Based on the intuition that the few-shot training instances should be diverse and representative of the entire data distribution, we propose a simple selection strategy with K-means clustering. We show that even with the naive clustering-based approach, the generation models consistently outperform random sampling on three text generation tasks: data-to-text generation, document summarization and question generation. The code and training data are made available at https://gitlab.com/erniecyc/ few-selector. We hope that this work will call for more attention on this largely unexplored area. * Equal contribution. X.shen is now at Amazon Alexa AI.

show abstract

Does the Order of Training Samples Matter? Improving Neural Data-to-Text Generation with Curriculum Learning

Chang

Yeh

Demberg

2021

View full text Add to dashboard Cite

Recent advancements in data-to-text generation largely take on the form of neural end-toend systems. Efforts have been dedicated to improving text generation systems by changing the order of training samples in a process known as curriculum learning. Past research on sequence-to-sequence learning showed that curriculum learning helps to improve both the performance and convergence speed. In this work, we delve into the same idea surrounding the training samples consisting of structured data and text pairs, where at each update, the curriculum framework selects training samples based on the model's competence. Specifically, we experiment with various difficulty metrics and put forward a soft edit distance metric for ranking training samples. Our benchmarks show faster convergence speed where training time is reduced by 38.7% and performance is boosted by 4.84 BLEU.

show abstract

Decorate the Examples: A Simple Method of Prompt Design for Biomedical Relation Extraction

Yeh¹,

Lee²,

Zweigenbaum³

2022

Preprint

View full text Add to dashboard Cite

Relation extraction is a core problem for natural language processing in the biomedical domain. Recent research on relation extraction showed that prompt-based learning improves the performance on both fine-tuning on full training set and few-shot training. However, less effort has been made on domain-specific tasks where good prompt design can be even harder. In this paper, we investigate prompting for biomedical relation extraction, with experiments on the ChemProt dataset. We present a simple yet effective method to systematically generate comprehensive prompts that reformulate the relation extraction task as a cloze-test task under a simple prompt formulation.In particular, we experiment with different ranking scores for prompt selection. With BioMed-RoBERTa-base, our results show that prompting-based fine-tuning obtains gains by 14.21 F1 over its regular fine-tuning baseline, and 1.14 F1 over SciFive-Large, the current state-of-the-art on ChemProt. Besides, we find prompt-based learning requires fewer training examples to make reasonable predictions. The results demonstrate the potential of our methods in such a domain-specific relation extraction task.

show abstract

Does the Order of Training Samples Matter? Improving Neural Data-to-Text Generation with Curriculum Learning

Chang

Yeh

Demberg

2021

Preprint

View full text Add to dashboard Cite

Time-Aware Ancient Chinese Text Translation and Inference

Chang¹,

Shiue²,

Yeh³

et al. 2021

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Hui-Syuan Yeh

On Training Instance Selection for Few-Shot Neural Text Generation

Does the Order of Training Samples Matter? Improving Neural Data-to-Text Generation with Curriculum Learning

Decorate the Examples: A Simple Method of Prompt Design for Biomedical Relation Extraction

Does the Order of Training Samples Matter? Improving Neural Data-to-Text Generation with Curriculum Learning

Time-Aware Ancient Chinese Text Translation and Inference

Contact Info

Product

Resources

About