SoK: Hate, Harassment, and the Changing Landscape of Online Abuse

Thomas, Kurt; Akhawe, Devdatta; Bailey, Michael; Boneh, Dan; Bursztein, Elie; Consolvo, Sunny; Dell, Nicola; Durumeric, Zakir; Kelley, Patrick Gage; Kumar, Deepak; McCoy, Damon; Meiklejohn, Sarah; Ristenpart, Thomas; Stringhini, Gianluca

doi:10.1109/sp40001.2021.00028

Cited by 82 publications

(31 citation statements)

References 121 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We use the term toxic content as an umbrella for identitybased attacks such as anti-Semitism or racism posted publicly to social media [19,37,55], bullying in online gaming or replies to posts [35,50], trolling [8], threats of violence, sexual harassment, and more [47,52]. These attacks represent just a subset of abuse stemming from hate and harassment, a much broader threat that encompasses any activity where an attacker attempts to inflict emotional harm on a target (e.g., stalking, doxxing, sextortion, and intimate partner violence) [9,52]. Unlike spam, phishing, or related abuse classification problems that can rely on expert raters, toxic content is an inherently subjective problem as we show in our work.…”

Section: What Is Toxic Content?mentioning

confidence: 99%

“…Online hate and harassment is a pernicious threat facing 48% of Internet users [52]. In response to this growing challenge, online platforms have developed automated tools to take action against toxic content (e.g., hate speech, threats, identity attacks).…”

Section: Introductionmentioning

confidence: 99%

“…These "gray areas" stem from the fact that users may disagree about what constitutes toxic content online based on their lived experiences, cultural perspective, political views towards free speech, or access to appropriate context [24,50]. While prior research has demonstrated that certain groups are more at-risk of experiencing online hate and harassment [45,52], no study has investigated how users from diverse backgrounds interpret online toxicity or how their views on what content they would like to see online differ. Understanding these nuanced differences is an important first step to designing harassment defenses for diverse Internet users.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Designing Toxic Content Classification for a Diversity of Perspectives

Kumar¹,

Kelley²,

Consolvo³

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

In this work, we demonstrate how existing classifiers for identifying toxic comments online fail to generalize to the diverse concerns of Internet users. We survey 17,280 participants to understand how user expectations for what constitutes toxic content differ across demographics, beliefs, and personal experiences. We find that groups historically at-risk of harassment-such as people who identify as LGBTQ+ or young adults-are more likely to to flag a random comment drawn from Reddit, Twitter, or 4chan as toxic, as are people who have personally experienced harassment in the past. Based on our findings, we show how current one-size-fits-all toxicity classification algorithms, like the Perspective API from Jigsaw, can improve in accuracy by 86% on average through personalized model tuning. Ultimately, we highlight current pitfalls and new design directions that can improve the equity and efficacy of toxic content classifiers for all users.

show abstract

Section: What Is Toxic Content?mentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Designing Toxic Content Classification for a Diversity of Perspectives

Kumar¹,

Kelley²,

Consolvo³

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…Technology-enabled IPV is a troubling phenomenon that fits into the broader ecosystem of online hate, harassment, and abuse [49]. Its manifestations include many forms of harassment [6,23,39], character assassination through faked revenge porn [43], impersonation attacks that damage the targeted individual's relationships [25,40], and above all, spying on an intimate partner through stalkerware and other means, such as knowledge of the survivor's account credentials [24,25].…”

Section: Background and Related Workmentioning

confidence: 99%

“…In the U.S., 15.8% of women and 5.3% of men reported being subjected to stalking violence "in which they felt very fearful or believed that they or someone close to them would be harmed or killed" [8]. IPV survivors have shed light on the many ways in which technology plays a role in inter-personal attacks [8,20,40,43,49], of which tech-enabled stalking and spying by current or former romantic partners are especially common and pernicious [24,34]. In a recent survey, 10% of the U.S. adult respondents admitted to using a mobile phone app to spy on an intimate partner [50].…”

Section: Introductionmentioning

confidence: 99%

Towards Stalkerware Detection with Precise Warnings

Han

Roundy

Tamersoy

2021

Annual Computer Security Applications Conference

View full text Add to dashboard Cite

Stalkerware enables individuals to conduct covert surveillance on a targeted person's device. Android devices are a particularly fertile ground for stalkerware, most of which spy on a single communication channel, sensor, or category of private data, though 27% of stalkerware surveil multiple of private data sources. We present Dosmelt, a system that enables stalkerware warnings that precisely characterize the types of surveillance conducted by Android stalkerware so that surveiled individuals can take appropriate mitigating action. We use an active-and semi-supervised learning to make headway on this task, which is vital because we are the first to characterize stalkerware according to its individual surveillance capabilities at a significant scale, which requires time-consuming expert-labeling of stalkerware apps. Dosmelt leverages the observation that stalkerware differs from other categories of spyware in its open advertising of its surveillance capabilities, which we detect on the basis of the titles and self-descriptions of stalkerware apps that are posted on Android app stores. Dosmelt achieves up to 96% AUC for stalkerware detection with a 91% Macro-F1 score of surveillance capability attribution for stalkerware apps. Dosmelt has detected hundreds of new stalkerware apps that we have added to the Stalkerware Threat List.

show abstract

Nine Challenges for Immersive Entertainment

Lages

2023

Communications in Computer and Information Science

View full text Add to dashboard Cite

SoK: Hate, Harassment, and the Changing Landscape of Online Abuse

Cited by 82 publications

References 121 publications

Designing Toxic Content Classification for a Diversity of Perspectives

Designing Toxic Content Classification for a Diversity of Perspectives

Towards Stalkerware Detection with Precise Warnings

Nine Challenges for Immersive Entertainment

Contact Info

Product

Resources

About