Proceedings of the 13th International Workshop on Semantic Evaluation 2019
DOI: 10.18653/v1/s19-2107
|View full text |Cite
|
Sign up to set email alerts
|

Emad at SemEval-2019 Task 6: Offensive Language Identification using Traditional Machine Learning and Deep Learning approaches

Abstract: In this paper, the used methods and the results obtained by our team, entitled Emad, on the OffensEval 2019 shared task organized at Se-mEval 2019 are presented. The OffensEval shared task includes three sub-tasks namely Offensive language identification, Automatic categorization of offense types and Offense target identification. We participated in subtask A and tried various methods including traditional machine learning methods, deep learning methods and also a combination of the first two sets of methods. … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
6
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(6 citation statements)
references
References 8 publications
0
6
0
Order By: Relevance
“…Among these three approaches, the rule-based black-list was the best with a 0.73 F-score. Kebriaei et al proposed machine learning algorithms, deep learning algorithms and the ensemble of these algorithms, and also, realized data augmentation from different sources [29]. The most successful Fscore (0.76) was obtained with the application of SVM to expanded data.…”
Section: Related Workmentioning
confidence: 99%
“…Among these three approaches, the rule-based black-list was the best with a 0.73 F-score. Kebriaei et al proposed machine learning algorithms, deep learning algorithms and the ensemble of these algorithms, and also, realized data augmentation from different sources [29]. The most successful Fscore (0.76) was obtained with the application of SVM to expanded data.…”
Section: Related Workmentioning
confidence: 99%
“…In [44], they applied dependency based embeddings for word substitution to generate text while leveraging textual membership queries. Some other studies leveraged solutions such as shifting the position of words in a zero padded representation [30], synthetic minority oversampling, random over-and under-sampling, and AdaSYN [29], adding common misspelling of words to data and collecting tweets that contain swear words in conjunction with positive adjectives or racial and religious tweets [38], adding tweets with disgust and anger emotions from suspended accounts to the data [1], bootstrapping from another dataset; embedding based [13] or sentiment polarity based [8] methods.…”
Section: Related Studiesmentioning
confidence: 99%
“…The research area of hate speech detection framed as a text classification task has come a long way in recent years. The outpour of research studies tackle different aspects of the problem and the resulting proposed solutions include the manual extraction of generic and task-specific features and the careful design of machine learning architectures [13,20,21,23,27]. Since most of the existing work have formulated the problem of detecting textual hate speech as a text classification task, it naturally requires access to a large source of clean, bias free, balanced textual data.…”
Section: Introductionmentioning
confidence: 99%
“…Traditional machine learning (ML) approaches such as support vector machines (SVM) by Perelló et al (2019) can be trained on hate speech tweets by identifying n-grams features which could be improved further by combining word embedding with sentiment features. Research by Kebriaei et al (2019) shows how a convolutional neural network (CNN) shows higher macro averaged F1-score than traditional ML approaches such as SVM, random forest (RF) and naive Bayes (NB). (Rajendran et al, 2019) uses an ensemble of classifiers to classify the offensive text in an imbalanced dataset by using models with Synthetic Minority Over-sampling technique (SMOTE).…”
Section: Related Workmentioning
confidence: 99%