Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2021
DOI: 10.18653/v1/2021.emnlp-main.46
|View full text |Cite
|
Sign up to set email alerts
|

Coarse2Fine: Fine-grained Text Classification on Coarsely-grained Annotated Data

Abstract: Existing text classification methods mainly focus on a fixed label set, whereas many realworld applications require extending to new fine-grained classes as the number of samples per label increases. To accommodate such requirements, we introduce a new problem called coarse-to-fine grained classification, which aims to perform fine-grained classification on coarsely annotated data. Instead of asking for new fine-grained human annotations, we opt to leverage label surface names as the only human guidance and we… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 11 publications
(10 citation statements)
references
References 16 publications
0
10
0
Order By: Relevance
“…These results confirm that CO is a strong heuristic for the task of finegrained semantic indexing, outperforming again dictionary-based heuristics. In addition, the weak labelling enhancement approach (ALO 3 ) achieved an overall maF 1 about 5pp higher than CO alone, which is a statistically significant difference 35 . The performance of LM 3 was not significantly lower than that of ALO 3 , as the two approaches achieved practically the same F 1 score for most of the 70 labels.…”
Section: Evaluation On New Labels Introduced After 2006mentioning
confidence: 86%
See 3 more Smart Citations
“…These results confirm that CO is a strong heuristic for the task of finegrained semantic indexing, outperforming again dictionary-based heuristics. In addition, the weak labelling enhancement approach (ALO 3 ) achieved an overall maF 1 about 5pp higher than CO alone, which is a statistically significant difference 35 . The performance of LM 3 was not significantly lower than that of ALO 3 , as the two approaches achieved practically the same F 1 score for most of the 70 labels.…”
Section: Evaluation On New Labels Introduced After 2006mentioning
confidence: 86%
“…In contrast to the broader problem of hierarchical text classification, coarsegrained class labels are already available in the biomedical indexing problem that we study. Recently, Mekala et al [35] focused on this special case of fine-grained indexing, where coarse-grained labels are available, proposing the Coarse2Fine (C2F ) method. C2F relies on the literal occurrence of fine-grained labels to create an initial weakly labeled dataset.…”
Section: Fine-grained Classification Without Labelled Datamentioning
confidence: 99%
See 2 more Smart Citations
“…Recent advances in NLP demonstrate exceptional capabilities of generative language models in generating text (Radford and Narasimhan, 2018;Radford et al, 2019;Lewis et al, 2020;Brown et al, 2020). Prior work (Kumar et al, 2020;Anaby-Tavor et al, 2020;Mekala et al, 2021)…”
Section: Introductionmentioning
confidence: 99%