Proceedings of the Tenth International Workshop on Natural Language Processing for Social Media 2022
DOI: 10.18653/v1/2022.socialnlp-1.5
|View full text |Cite
|
Sign up to set email alerts
|

A Comparative Study on Word Embeddings and Social NLP Tasks

Abstract: In recent years, grey social media platforms, those with a loose moderation policy on cyberbullying, have been attracting more users. Recently, data collected from these types of platforms have been used to pre-train word embeddings (social-media-based), yet these word embeddings have not been investigated for social NLP related tasks. In this paper, we carried out a comparative study between social-mediabased and non-social-media-based word embeddings on two social NLP tasks: Detecting cyberbullying and Measu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
6
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(6 citation statements)
references
References 14 publications
0
6
0
Order By: Relevance
“…Then, I investigate the performance of Bi-LSTM model with an un-trainable embeddings layer of the five word embeddings on the used five hate-speechrelated datasets. The results indicate that the word embeddings that are pre-trained on biased datasets social-media-based, outperform the other word embeddings that are trained on informational data, informational-based on the tasks of offenses categorization and hate speech detection (Elsafoury et al, 2022b).…”
Section: The Explainability Perspectivementioning
confidence: 96%
See 3 more Smart Citations
“…Then, I investigate the performance of Bi-LSTM model with an un-trainable embeddings layer of the five word embeddings on the used five hate-speechrelated datasets. The results indicate that the word embeddings that are pre-trained on biased datasets social-media-based, outperform the other word embeddings that are trained on informational data, informational-based on the tasks of offenses categorization and hate speech detection (Elsafoury et al, 2022b).…”
Section: The Explainability Perspectivementioning
confidence: 96%
“…In (Elsafoury et al, 2022a;Elsafoury, 2023), I investigate how the hateful content on social media and other platforms that are used to collect data and pre-train NLP models, is being encoded by those NLP models to form systematic offensive stereotyping (SOS) bias against marginalized groups of people. Especially with imbalanced representation and co-occurrence of the hateful content with the marginalized identity groups.…”
Section: The Offensive Stereotyping Bias Perspectivementioning
confidence: 99%
See 2 more Smart Citations
“…(Elsafoury, 2022). I propose a method to measure it and validate it in static (Elsafoury et al, 2022a) and contextual word embeddings (Elsafoury et al, 2022a). Finally, I study how it impacts the performance of these word embeddings on hate speech detection models.…”
Section: Contributionsmentioning
confidence: 99%