Task Transfer and Domain Adaptation for Zero-Shot Question Answering

Peng, Xiang; Sheng, Alex; Shimshoni, David; Singhal, Aditya; Rosenthal, Sara Brin; Sil, Avirup

doi:10.18653/v1/2022.deeplo-1.12

Cited by 4 publications

(2 citation statements)

References 10 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The definition of 'zero-shot' in this paper follows recent studies(Pan et al, 2022;Zhao et al, 2022), and is similar to unsupervised domain adaptation, as discussed in §2. Another similar usage of 'zero-shot' is found in cross-lingual setups where no task labels are accessible in the target test language but labels in the same task are available in a source language.…”

mentioning

confidence: 99%

“…https://beta.openai.com/examples/ default-tldr-summary 10 These are still zero-shot baselines as they do not use indomain task examples.11 This baseline category is similar to contemporaneous work(Pan et al, 2022) where domain-task transfer is achieved through sequential in-domain off-task training followed by general-domain in-task training. Here we do not use indomain task data of any kind.…”

mentioning

confidence: 99%

See 1 more Smart Citation

Compositional Zero-Shot Domain Transfer with Text-to-Text Models

Liu¹,

Liu²,

Bannur³

et al. 2023

Preprint

View full text Add to dashboard Cite

Label scarcity is a bottleneck for improving task performance in specialised domains. We propose a novel compositional transfer learning framework (DOT5 1 ) for zeroshot domain transfer. Without access to in-domain labels, DOT5 jointly learns domain knowledge (from masked language modelling of unlabelled in-domain free text) and task knowledge (from task training on more readily available general-domain data) in a multi-task manner. To improve the transferability of task training, we design a strategy named NLGU: we simultaneously train natural language generation (NLG) for indomain label-to-data generation which enables data augmentation for self-finetuning and natural language understanding (NLU) for label prediction. We evaluate DOT5 on the biomedical domain and the resource-lean subdomain of radiology, focusing on natural language inference, text summarisation and embedding learning. DOT5 demonstrates the effectiveness of compositional transfer learning through multi-task learning. In particular, DOT5 outperforms the current stateof-the-art in zero-shot transfer by over 7 absolute points in accuracy on RadNLI. We validate DOT5 with ablations and a case study demonstrating its ability to solve challenging NLI examples requiring in-domain expertise. * Work done at Microsoft Health Futures. 1 DOT5 (read as "dot five"): Domain Compositional ZerOshot T5.

show abstract

mentioning

confidence: 99%

mentioning

confidence: 99%