Experts and Machines against Bullies: A Hybrid Approach to Detect Cyberbullies

Dadvar, Maral; Trieschnigg, Dolf; Jong, Franciska de

doi:10.1007/978-3-319-06483-3_25

Cited by 111 publications

(85 citation statements)

References 12 publications

(11 reference statements)

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…A barrier for the use of text mining techniques for abusive content detection is the lack of labelled datasets in the field. At present, researchers collect data, and annotate by one of two approaches -their own labelling effort [30,9,1,19] which is timeconsuming or through the use of crowdsourcing services [25,2] such as Amazon's Mechanical Turk which can be costly.…”

Section: Related Workmentioning

confidence: 99%

“…Other research takes extra information into account to enhance classification accuracy such as user profile features [9,8] including age and gender; semantic features of the user comment [30,9,1,32] such as parts of speech, number of pronouns; and features such as profanity word occurrences [1,32,4]. Text context features have also been analysed in recent years.…”

Section: Related Workmentioning

confidence: 99%

“…Wellknown classifiers have been used in this domain including Support Vector Machines (SVM) [9,16], Näive Bayes [4,13], logistic regression [29], and decision trees [25,16]. Classifier ensemble solutions have also proven successful.…”

Section: Related Workmentioning

confidence: 99%

“…Dadvar et al [10] detected cyberbullies instead of detecting text content; Nahar et al [21] used social networks to present a graph model, identifying the most active cyberbullying predators and victims; Xu et al [31] explored the detection of roles within cyberbullying, and identified those of bully, victim, accuser and reporter. In addition, research [30,13,29] used Latent Dirichlet Allocation (LDA) to extract the main topics for each text content.…”

Section: Related Workmentioning

confidence: 99%

“…D2-YouTube Dadvar et al [9] created a corpus of comments from the video upload site, YouTube, by scraping comments from sensitive cyberbullying topics within the site. Each labelled instance consists of a single user's comment.…”

Section: Datasetsmentioning

confidence: 99%

See 4 more Smart Citations

Harnessing the Power of Text Mining for the Detection of Abusive Content in Social Media

Chen¹,

McKeever²,

Delany³

2016

Advances in Intelligent Systems and Computing

View full text Add to dashboard Cite

The issues of cyberbullying and online harassment have gained considerable coverage in the last number of years. Social media providers need to be able to detect abusive content both accurately and efficiently in order to protect their users. Our aim is to investigate the application of core text mining techniques for the automatic detection of abusive content across a range of social media sources include blogs, forums, media-sharing, Q&A and chat -using datasets from Twitter, YouTube, MySpace, Kongregate, Formspring and Slashdot. Using supervised machine learning, we compare alternative text representations and dimension reduction approaches, including feature selection and feature enhancement, demonstrating the impact of these techniques on detection accuracies. In addition, we investigate the need for sampling on imbalanced datasets. Our conclusions are: (1) Dataset balancing boosts accuracies significantly for social media abusive content detection; (2) Feature reduction, important for large feature sets that are typical of social media datasets, improves efficiency whilst maintaining detection accuracies; (3) The use of generic structural features common across all our datasets proved to be of limited use in the automatic detection of abusive content. Our findings can support practitioners in selecting appropriate text mining strategies in this area.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Section: Datasetsmentioning

confidence: 99%

See 3 more Smart Citations

Harnessing the Power of Text Mining for the Detection of Abusive Content in Social Media

Chen¹,

McKeever²,

Delany³

2016

Advances in Intelligent Systems and Computing

View full text Add to dashboard Cite

show abstract

Identification of cyberbullying on multi‐modal social media posts using genetic algorithm

Kumari

Singh

2020

Trans Emerging Tel Tech

View full text Add to dashboard Cite

Cyberbullying is one of the detrimental effects, social media is facing nowadays. With the increasing use of photo sharing and text comments, the severity of cyberbullying has increased many folds. Automated tools to detect these events have become necessary to make this platform healthy and secure. Sometimes innocent‐looking images and text also convey bullying messages when posted together. So, the separate systems for processing text and images may not work properly to identify all cases of cyberbullying. In this research, we have tried to extract combined features of text and images to identify different cases of cyberbullying. We used a pre‐trained VGG‐16 network and convolutional neural network to extract the features from images and text, respectively. These features are further optimized using genetic algorithm to increase the efficiency of the whole system. Our proposed model is validated with a dataset containing text and image to achieve an F1‐score 78% which shows an improvement of 9% over earlier reported results on the same dataset.

show abstract

A Comparison of Classical Versus Deep Learning Techniques for Abusive Content Detection on Social Media Sites

Chen¹,

McKeever²,

Delany³

2018

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Experts and Machines against Bullies: A Hybrid Approach to Detect Cyberbullies

Cited by 111 publications

References 12 publications

Harnessing the Power of Text Mining for the Detection of Abusive Content in Social Media

Harnessing the Power of Text Mining for the Detection of Abusive Content in Social Media

Identification of cyberbullying on multi‐modal social media posts using genetic algorithm

A Comparison of Classical Versus Deep Learning Techniques for Abusive Content Detection on Social Media Sites

Contact Info

Product

Resources

About