Untitled

Mishra, Pushkar; Tredici, Marco Del; Yannakoudakis, Helen; Shutova, Ekaterina

doi:10.18653/v1/n19-1221

Cited by 20 publications

(8 citation statements)

References 9 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In this paper, we follow and extend the work of Reference [3][4][5] on anti-refugee and anti-migrant hate speech detection. We apply hate speech detection to Greek and enrich this with a multimodal approach, in order to take into account hateful content that does not necessarily carry textual streams.…”

Section: Introductionmentioning

confidence: 94%

“…They apply a series of models and report best precision, recall, and f1-score of 0.91, 0.90, and 0.90, respectively. Mishra et al [5] use graph convolutional networks to attack the problem, utilising social graph information as part of the model. Waseem and Hovy focus on data collection [3] and on linguistic features that improve quality, while Waseem [4] provides a list of criteria for the annotation process of hate speech.…”

Section: Hate Speech Detection As a Text Classification Problemmentioning

confidence: 99%

“…While there have been some publicly available datasets of labelled hateful tweets, in the form of tweet IDs (References [3,4,26]), as Mishra observes [5], many of these tweets have been deleted and abusive users have been suspended due to violations of Twitter's policy. Thus, the available datasets are no longer valid for baselines and comparisons and can only be partially used as additional data for model training.…”

Section: Datasetmentioning

confidence: 99%

See 2 more Smart Citations

Multimodal Hate Speech Detection in Greek Social Media

Perifanos

Goutsos

2021

MTI

View full text Add to dashboard Cite

Hateful and abusive speech presents a major challenge for all online social media platforms. Recent advances in Natural Language Processing and Natural Language Understanding allow for more accurate detection of hate speech in textual streams. This study presents a new multimodal approach to hate speech detection by combining Computer Vision and Natural Language processing models for abusive context detection. Our study focuses on Twitter messages and, more specifically, on hateful, xenophobic, and racist speech in Greek aimed at refugees and migrants. In our approach, we combine transfer learning and fine-tuning of Bidirectional Encoder Representations from Transformers (BERT) and Residual Neural Networks (Resnet). Our contribution includes the development of a new dataset for hate speech classification, consisting of tweet IDs, along with the code to obtain their visual appearance, as they would have been rendered in a web browser. We have also released a pre-trained Language Model trained on Greek tweets, which has been used in our experiments. We report a consistently high level of accuracy (accuracy score = 0.970, f1-score = 0.947 in our best model) in racist and xenophobic speech detection.

show abstract

Section: Introductionmentioning

confidence: 94%

Section: Hate Speech Detection As a Text Classification Problemmentioning

confidence: 99%

Section: Datasetmentioning

confidence: 99%

See 1 more Smart Citation

Multimodal Hate Speech Detection in Greek Social Media

Perifanos

Goutsos

2021

MTI

View full text Add to dashboard Cite

show abstract

“…Recently, approaches to abuse detection have moved towards more complex models that utilize auxiliary knowledge in addition to the abuse-annotated data. For instance, Mishra et al (2018aMishra et al ( , 2019a) used community-based author information as features in their classifiers with promising results. Founta et al (2019) used transfer learning to fine-tune features from the author metadata network to improve abuse detection.…”

Section: Related Workmentioning

confidence: 99%

“…The NLP community has experimented with a range of techniques for abuse detection, such as recurrent and convolutional neural networks (Pavlopoulos et al, 2017;Park and Fung, 2017;Wang, 2018), character-based models (Nobata et al, 2016) and graph-based learning methods (Mishra et al, 2018a;Aglionby et al, 2019;Mishra et al, 2019a), obtaining promising results. However, all of the existing approaches have focused on modelling the linguistic properties of the comments or the meta-data about the users.…”

Section: Introductionmentioning

confidence: 99%

Joint Modelling of Emotion and Abusive Language Detection

Rajamanickam¹,

Mishra²,

Yannakoudakis

et al. 2020

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics

Self Cite

View full text Add to dashboard Cite

The rise of online communication platforms has been accompanied by some undesirable effects, such as the proliferation of aggressive and abusive behaviour online. Aiming to tackle this problem, the natural language processing (NLP) community has experimented with a range of techniques for abuse detection. While achieving substantial success, these methods have so far only focused on modelling the linguistic properties of the comments and the online communities of users, disregarding the emotional state of the users and how this might affect their language. The latter is, however, inextricably linked to abusive behaviour. In this paper, we present the first joint model of emotion and abusive language detection, experimenting in a multi-task learning framework that allows one task to inform the other. Our results demonstrate that incorporating affective features leads to significant improvements in abuse detection performance across datasets.

show abstract