Benchmarking Zero-shot Text Classification: Datasets, Evaluation and Entailment Approach

Yin, Wenpeng; Hay, Jamaal; Roth, Dan

doi:10.18653/v1/d19-1404

Cited by 316 publications

(306 citation statements)

References 15 publications

Supporting

Mentioning

303

Contrasting

Order By: Relevance

“…Early attempts rely on distant supervision such as Wikipedia to interpret the label name semantics and derive documentconcept relevance via explicit semantic analysis (Gabrilovich and Markovitch, 2007). Since the classifier is learned purely from general knowledge without even requiring any unlabeled domainspecific data, these methods are called dataless classification (Chang et al, 2008;Song and Roth, 2014;Yin et al, 2019). Later, topic models (Chen et al, 2015;Li et al, 2016) are exploited for seedguided classification to learn seed word-aware topics by biasing the Dirichlet priors and to infer posterior document-topic assignment.…”

Section: Weakly-supervised Text Classificationmentioning

confidence: 99%

Text Classification Using Label Names Only: A Language Model Self-Training Approach

Yu¹,

Zhang²,

Huang³

et al. 2020

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)

158

142

View full text Add to dashboard Cite

Current text classification methods typically require a good number of human-labeled documents as training data, which can be costly and difficult to obtain in real applications. Humans can perform classification without seeing any labeled examples but only based on a small set of words describing the categories to be classified. In this paper, we explore the potential of only using the label name of each class to train classification models on unlabeled data, without using any labeled documents. We use pre-trained neural language models both as general linguistic knowledge sources for category understanding and as representation learning models for document classification. Our method (1) associates semantically related words with the label names, (2) finds category-indicative words and trains the model to predict their implied categories, and (3) generalizes the model via self-training. We show that our model achieves around 90% accuracy on four benchmark datasets including topic and sentiment classification without using any labeled documents but learning from unlabeled data supervised by at most 3 words (1 in most cases) per class as the label name 1 .

show abstract

Section: Weakly-supervised Text Classificationmentioning

confidence: 99%

Text Classification Using Label Names Only: A Language Model Self-Training Approach

Yu¹,

Zhang²,

Huang³

et al. 2020

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)

158

142

View full text Add to dashboard Cite

show abstract

“…Recent work in NLP has shown significant progress in learning tasks from examples. Large pretrained language models have dramatically improved performance on standard benchmarks (Peters et al, 2018;Devlin et al, 2019;Raffel et al, 2019) and have shown promising results in zero shot prediction by leveraging their language understanding capabilities (Levy et al, 2017;Zhou et al, 2018;Yin et al, 2019).…”

Section: Introductionmentioning

confidence: 99%

Learning from Task Descriptions

Weller¹,

Lourie²,

Gardner³

et al. 2020

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)

View full text Add to dashboard Cite

Typically, machine learning systems solve new tasks by training on thousands of examples. In contrast, humans can solve new tasks by reading some instructions, with perhaps an example or two. To take a step toward closing this gap, we introduce a framework for developing NLP systems that solve new tasks after reading their descriptions, synthesizing prior work in this area. We instantiate this framework with a new English language dataset, ZEST, structured for task-oriented evaluation on unseen tasks. Formulating task descriptions as questions, we ensure each is general enough to apply to many possible inputs, thus comprehensively evaluating a model's ability to solve each task. Moreover, the dataset's structure tests specific types of systematic generalization. We find that the state-of-the-art T5 model achieves a score of 12% on ZEST, leaving a significant challenge for NLP researchers. 1

show abstract

“…This coreference example shows that transforming an NLP task as textual entailment may obtain surprising advantages. There are more NLP tasks that can fit the entailment framework easily, such as text classification (Yin et al, 2019), relation extraction, summarization, etc. However, we also need to admit that reformulating into entailment may also need to fight against new challenges.…”

Section: Results and Analysesmentioning

confidence: 99%

Universal Natural Language Processing with Limited Annotations: Try Few-shot Textual Entailment as a Start

Yin¹,

Rajani²,

Radev

et al. 2020

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)

Self Cite

View full text Add to dashboard Cite

A standard way to address different NLP problems is by first constructing a problem-specific dataset, then building a model to fit this dataset. To build the ultimate artificial intelligence, we desire a single machine that can handle diverse new problems, for which task-specific annotations are limited. We bring up textual entailment as a unified solver for such NLP problems. However, current research of textual entailment has not spilled much ink on the following questions: (i) How well does a pretrained textual entailment system generalize across domains with only a handful of domainspecific examples? and (ii) When is it worth transforming an NLP task into textual entailment? We argue that the transforming is unnecessary if we can obtain rich annotations for this task. Textual entailment really matters particularly when the target NLP task has insufficient annotations.Universal NLP 1 can be probably achieved through different routines. In this work, we introduce Universal Few-shot textual Entailment (UFO-ENTAIL). We demonstrate that this framework enables a pretrained entailment model to work well on new entailment domains in a few-shot setting, and show its effectiveness as a unified solver for several downstream NLP tasks such as question answering and coreference resolution when the end-task annotations are limited.

show abstract

Benchmarking Zero-shot Text Classification: Datasets, Evaluation and Entailment Approach

Cited by 316 publications

References 15 publications

Text Classification Using Label Names Only: A Language Model Self-Training Approach

Text Classification Using Label Names Only: A Language Model Self-Training Approach

Learning from Task Descriptions

Universal Natural Language Processing with Limited Annotations: Try Few-shot Textual Entailment as a Start

Contact Info

Product

Resources

About