2019 IEEE International Conference on Electro Information Technology (EIT) 2019
DOI: 10.1109/eit.2019.8833846
|View full text |Cite
|
Sign up to set email alerts
|

Multilingual Cyberbullying Detection System

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
20
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 50 publications
(21 citation statements)
references
References 18 publications
(27 reference statements)
1
20
0
Order By: Relevance
“…The authors observed that tweets with similar content written in different languages hinder the classifiers' performances. Pawar and Raje [36] modeled linguistic patterns upon a hand-labeled bilingual (Hindi and Marathi languages) dataset using Machine Learning and Natural Language Processing techniques to detect cyberbullying in Twitter and Internet forums. Moreover, Steimel et al [37] experimented with a general cyberbullying detection model across multiple languages (English and German) with data collected from Twitter.…”
Section: Related Workmentioning
confidence: 99%
“…The authors observed that tweets with similar content written in different languages hinder the classifiers' performances. Pawar and Raje [36] modeled linguistic patterns upon a hand-labeled bilingual (Hindi and Marathi languages) dataset using Machine Learning and Natural Language Processing techniques to detect cyberbullying in Twitter and Internet forums. Moreover, Steimel et al [37] experimented with a general cyberbullying detection model across multiple languages (English and German) with data collected from Twitter.…”
Section: Related Workmentioning
confidence: 99%
“…The datasets are usually collected by crawling the target social media using its Application Programming Interface (API). There are also prepared datasets on Kaggle website [44].…”
Section: Datasetsmentioning
confidence: 99%
“…However, there are other datasets which are used in practice, but they are not popular among researchers. Pawar et al [44] used datasets from multiple sources as they realized gathering dataset for Hindi and Marathi language would be a challenge. They obtained dataset from tour reviews, movie reviews and newspaper reviews for Hindi language.…”
Section: Formspringmentioning
confidence: 99%
“…Through a revision of the literature, works were studied that propose some aggression detection mechanism. As has already been mentioned, most of these works were applied to texts in English [8][9][10][11][12][13][14][15][16][17]. Regarding the approach used, more than 50% of the articles used ML techniques, using different corpora to train classifiers.…”
Section: Introductionmentioning
confidence: 99%