Sam Witteveen scite author profile

Sam Witteveen

5Publications

45Citation Statements Received

41Citation Statements Given

How they've been cited

How they cite others

Affiliations

Publications

Order By: Most citations

Paraphrasing with Large Language Models

Witteveen¹,

Andrews²

2019

View full text Add to dashboard Cite

Recently, large language models such as GPT-2 have shown themselves to be extremely adept at text generation and have also been able to achieve high-quality results in many downstream NLP tasks such as text classification, sentiment analysis and question answering with the aid of fine-tuning. We present a useful technique for using a large language model to perform the task of paraphrasing on a variety of texts and subjects. Our approach is demonstrated to be capable of generating paraphrases not only at a sentence level but also for longer spans of text such as paragraphs without needing to break the text into smaller chunks.

show abstract

Red Dragon AI at TextGraphs 2019 Shared Task: Language Model Assisted Explanation Generation

Chia¹,

Witteveen²,

Andrews³

2019

View full text Add to dashboard Cite

The TextGraphs-13 Shared Task on Explanation Regeneration (Jansen and Ustalov, 2019) asked participants to develop methods to reconstruct gold explanations for elementary science questions. Red Dragon AI's entries used the language of the questions and explanation text directly, rather than a constructing a separate graph-like representation. Our leaderboard submission placed us 3 rd in the competition, but we present here three methods of increasing sophistication, each of which scored successively higher on the test set after the competition close.

show abstract

Unsupervised Natural Question Answering with a Small Model

Andrews¹,

Witteveen²

2019

View full text Add to dashboard Cite

The recent demonstration of the power of huge language models such as GPT-2 to memorise the answers to factoid questions raises questions about the extent to which knowledge is being embedded directly within these large models. This short paper describes an architecture through which much smaller models can also answer such questions -by making use of 'raw' external knowledge. The contribution of this work is that the methods presented here rely on unsupervised learning techniques, complementing the unsupervised training of the Language Model. The goal of this line of research is to be able to add knowledge explicitly, without extensive training.

show abstract

Red Dragon AI at TextGraphs 2020 Shared Task : LIT : LSTM-Interleaved Transformer for Multi-Hop Explanation Ranking

Chia¹,

Witteveen²,

Andrews³

2020

View full text Add to dashboard Cite

Explainable question answering for science questions is a challenging task that requires multihop inference over a large set of fact sentences. To counter the limitations of methods that view each query-document pair in isolation, we propose the LSTM-Interleaved Transformer which incorporates cross-document interactions for improved multi-hop ranking. The LIT architecture can leverage prior ranking positions in the re-ranking setting. Our model is competitive on the current leaderboard for the TextGraphs 2020 shared task, achieving a test-set MAP of 0.5607, and would have gained third place had we submitted before the competition deadline. Our code implementation is made available at https://github.com/mdda/worldtree_corpus/ tree/textgraphs_2020

show abstract

Transformer to CNN: Label-scarce distillation for efficient text classification

Chia¹,

Witteveen²,

Andrews³

2019

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Sam Witteveen

Paraphrasing with Large Language Models

Red Dragon AI at TextGraphs 2019 Shared Task: Language Model Assisted Explanation Generation

Unsupervised Natural Question Answering with a Small Model

Red Dragon AI at TextGraphs 2020 Shared Task : LIT : LSTM-Interleaved Transformer for Multi-Hop Explanation Ranking

Transformer to CNN: Label-scarce distillation for efficient text classification

Contact Info

Product

Resources

About