Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP) 2020
DOI: 10.18653/v1/2020.emnlp-main.724
|View full text |Cite
|
Sign up to set email alerts
|

Text Classification Using Label Names Only: A Language Model Self-Training Approach

Abstract: Current text classification methods typically require a good number of human-labeled documents as training data, which can be costly and difficult to obtain in real applications. Humans can perform classification without seeing any labeled examples but only based on a small set of words describing the categories to be classified. In this paper, we explore the potential of only using the label name of each class to train classification models on unlabeled data, without using any labeled documents. We use pre-tr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

1
179
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 158 publications
(180 citation statements)
references
References 37 publications
1
179
0
Order By: Relevance
“…We also compare with LOTClass (Meng et al, 2020b), which works under the extremely weak supervision setting. In their experiments, it mostly relies on class names but has used a few keywords Table 1: An overview of our 7 benchmark datasets.…”
Section: Compared Methodsmentioning
confidence: 99%
See 4 more Smart Citations
“…We also compare with LOTClass (Meng et al, 2020b), which works under the extremely weak supervision setting. In their experiments, it mostly relies on class names but has used a few keywords Table 1: An overview of our 7 benchmark datasets.…”
Section: Compared Methodsmentioning
confidence: 99%
“…Compared with previous works Meng et al, 2020b), our X-Class has a significantly more mild requirement on human-provided class names in terms of quantity and quality. We have conducted an experiment in Table 4 for X-Class on 20News and NYT-Small by deleting all but one occurrence of a class name from the input corpus.…”
Section: Requirements On Class Namesmentioning
confidence: 99%
See 3 more Smart Citations