12th ACM Conference on Web Science 2020
DOI: 10.1145/3394231.3397892
|View full text |Cite
|
Sign up to set email alerts
|

Semi-Supervised Granular Classification Framework for Resource Constrained Short-texts

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2

Relationship

2
5

Authors

Journals

citations
Cited by 7 publications
(4 citation statements)
references
References 7 publications
0
4
0
Order By: Relevance
“…We combine the two embedding spaces with a novel end-toend scaled dot product cross attention mechanism which learns to attend on corpus level context information given an example, where the downstream task is text classification. In addition, we enable multilinguality in our model by making it applicable to realistic disaster situations while most existing works on disaster response domain are monolingual [1,9,12] only. In § 3.1, we discuss our method for obtaining example level contextual word embeddings with recent transformer based models (e.g BERT [8] or XLM-R [5]).…”
Section: Proposed Frameworkmentioning
confidence: 99%
See 1 more Smart Citation
“…We combine the two embedding spaces with a novel end-toend scaled dot product cross attention mechanism which learns to attend on corpus level context information given an example, where the downstream task is text classification. In addition, we enable multilinguality in our model by making it applicable to realistic disaster situations while most existing works on disaster response domain are monolingual [1,9,12] only. In § 3.1, we discuss our method for obtaining example level contextual word embeddings with recent transformer based models (e.g BERT [8] or XLM-R [5]).…”
Section: Proposed Frameworkmentioning
confidence: 99%
“…However, these user-generated contents, such as tweets, are generally in languages native to the location of the disaster. On the other hand, majority of works in the literature focus mostly on English language [1,3,9,12,17,21,22] only. Understanding and processing texts in multiple languages is of paramount importance for effective disaster mitigation.…”
Section: Introductionmentioning
confidence: 99%
“…Although existing works utilize such information to build models for crisis event analysis, standard supervised approaches require annotating vast amounts of data during disasters, which is impractical due to limited response time (Li et al, 2015;Caragea et al, 2016;Li et al, 2017Neppalli et al, 2018;Ray Chowdhury et al, 2020;Sosea et al, 2021). On the other hand, current semi-supervised models can be biased, performing moderately well for certain classes while extremely worse for others, resulting in a detrimentally negative effect on disaster monitoring and analysis (Alam et al, 2018;Ghosh and Desarkar, 2020;Sirbu et al, 2022;Zou et al, 2023;Wang et al, 2023a). For instance, neglecting life-essential classes, such as requests or urgent needs, displaced people & evacuations and injured or dead people, can have severely adverse consequences for relief efforts.…”
Section: Introductionmentioning
confidence: 99%
“…One primary reason for this is the difficulty in obtaining a sufficiently large number of annotations on time to assist development of a classifier. The problem has been explored in literature [1], [4] with the assumption that sufficient amount of labeled data is available. However, this assumption might not be realistic during a disaster.…”
Section: Introductionmentioning
confidence: 99%