2021
DOI: 10.48550/arxiv.2109.09483
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

ConvAbuse: Data, Analysis, and Benchmarks for Nuanced Abuse Detection in Conversational AI

Amanda Cercas Curry,
Gavin Abercrombie,
Verena Rieser

Abstract: We present the first English corpus study on abusive language towards three conversational AI systems gathered 'in the wild': an opendomain social bot, a rule-based chatbot, and a task-based system. To account for the complexity of the task, we take a more 'nuanced' approach where our ConvAI dataset reflects fine-grained notions of abuse, as well as views from multiple expert annotators. We find that the distribution of abuse is vastly different compared to other commonly used datasets, with more sexually tint… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 35 publications
0
1
0
Order By: Relevance
“…The designed dataset is annotated and consist of varied class labels at different granularity levels. A English text-based hate speech dataset is developed by considering three conversational AI systems, namely, open domain-based social bot, rule-based chatbot and task-based conversational AI system in (Curry et al, 2021).…”
Section: Hate Speech Detection Datasetsmentioning
confidence: 99%
“…The designed dataset is annotated and consist of varied class labels at different granularity levels. A English text-based hate speech dataset is developed by considering three conversational AI systems, namely, open domain-based social bot, rule-based chatbot and task-based conversational AI system in (Curry et al, 2021).…”
Section: Hate Speech Detection Datasetsmentioning
confidence: 99%