Constructing Artificial Data for Fine-tuning for Low-Resource Biomedical Text Tagging with Applications in PICO Annotation

Singh, Gaurav; Sabet, Zahra; Shawe‐Taylor, John; Thomas, James D.

doi:10.48550/arxiv.1910.09255

Search citation statements

Order By: Relevance

Paper Sections

Select...

Our Qualitative Analysis Provides Insights Into1

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2020

Publication Types

Select...

Other1

Relationship

Self Cite0

Independent1

Authors

Journals

Cited by 1 publication

(1 citation statement)

References 15 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We broadly categorise existing approaches based on their modification method into input-related, external and internal. Input modifications Singh et al, 2020;Lai et al, 2020;Ruan et al, 2020) adapt the information that is fed to BERT -e.g. feeding text triples separated by [SEP] tokens instead of sentence pairs as in Lai et al (2020) -while leaving the architecture unchanged.…”

Section: Our Qualitative Analysis Provides Insights Intomentioning

confidence: 99%

GiBERT: Introducing Linguistic Knowledge into BERT through a Lightweight Gated Injection Method

Peinelt,

Rei,

Liakata

2020

Preprint

View full text Add to dashboard Cite

Large pre-trained language models such as BERT have been the driving force behind recent improvements across many NLP tasks. However, BERT is only trained to predict missing words -either behind masks or in the next sentence -and has no knowledge of lexical, syntactic or semantic information beyond what it picks up through unsupervised pre-training. We propose a novel method to explicitly inject linguistic knowledge in the form of word embeddings into any layer of a pre-trained BERT. Our performance improvements on multiple semantic similarity datasets when injecting dependency-based and counter-fitted embeddings indicate that such information is beneficial and currently missing from the original model. Our qualitative analysis shows that counter-fitted embedding injection particularly helps with cases involving synonym pairs.

show abstract

Section: Our Qualitative Analysis Provides Insights Intomentioning

confidence: 99%