WikiConv: A Corpus of the Complete Conversational History of a Large Online Collaborative Community

Hua, Yiqing; Danescu-Niculescu-Mizil, Cristian; Taraborelli, Dario; Thain, Nithum; Sorensen, Jeffery; Dixon, Lucas

doi:10.18653/v1/d18-1305

Cited by 29 publications

(30 citation statements)

References 12 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We found that on these examples the silver labels have 0.51 precision and 1.00 recall. This yields 0.67 F1 measure and is somewhat lower than the expected 0.85 obtained for this classifier in (Hua et al, 2018). The difference indicates that the thresholds from (Hua et al, 2018) obtained on non-deleted comments from Wikipedia may not perform equally well on deleted comments.…”

Section: Datasetmentioning

confidence: 61%

“…This yields 0.67 F1 measure and is somewhat lower than the expected 0.85 obtained for this classifier in (Hua et al, 2018). The difference indicates that the thresholds from (Hua et al, 2018) obtained on non-deleted comments from Wikipedia may not perform equally well on deleted comments. To address this and increase the quality of the labels, more deleted comments should be manually labeled and thresholds retuned using, e.g., the same error rate method of (Wulczyn et al, 2017a).…”

Section: Datasetmentioning

confidence: 61%

See 1 more Smart Citation

Preemptive Toxic Language Detection in Wikipedia Comments Using Thread-Level Context

Karan¹,

Šnajder²

2019

Proceedings of the Third Workshop on Abusive Language Online

View full text Add to dashboard Cite

We address the task of automatically detecting toxic content in user generated texts. We focus on exploring the potential for preemptive moderation, i.e., predicting whether a particular conversation thread will, in the future, incite a toxic comment. Moreover, we perform preliminary investigation of whether a model that jointly considers all comments in a conversation thread outperforms a model that considers only individual comments. Using an existing dataset of conversations among Wikipedia contributors as a starting point, we compile a new large-scale dataset for this task consisting of labeled comments and comments from their conversation threads.

show abstract

Section: Datasetmentioning

confidence: 61%

Section: Datasetmentioning

confidence: 61%

Preemptive Toxic Language Detection in Wikipedia Comments Using Thread-Level Context

Karan¹,

Šnajder²

2019

Proceedings of the Third Workshop on Abusive Language Online

View full text Add to dashboard Cite

show abstract

“…Zhang et al's 'Conversations Gone Awry' dataset consists of 1,270 conversations that took place between Wikipedia editors on publicly accessible talk pages. The conversations are sourced from the WikiConv dataset (Hua et al, 2018) and labeled by crowdworkers as either containing a personal attack from within (i.e., hostile behavior by one user in the conversation directed towards another) or remaining civil throughout.…”

Section: Derailment Datasetsmentioning

confidence: 99%

Trouble on the Horizon: Forecasting the Derailment of Online Conversations as they Develop

Chang¹,

Danescu-Niculescu-Mizil²

2019

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conferen

Self Cite

View full text Add to dashboard Cite

Online discussions often derail into toxic exchanges between participants. Recent efforts mostly focused on detecting antisocial behavior after the fact, by analyzing single comments in isolation. To provide more timely notice to human moderators, a system needs to preemptively detect that a conversation is heading towards derailment before it actually turns toxic. This means modeling derailment as an emerging property of a conversation rather than as an isolated utterance-level event.Forecasting emerging conversational properties, however, poses several inherent modeling challenges. First, since conversations are dynamic, a forecasting model needs to capture the flow of the discussion, rather than properties of individual comments. Second, real conversations have an unknown horizon: they can end or derail at any time; thus a practical forecasting model needs to assess the risk in an online fashion, as the conversation develops. In this work we introduce a conversational forecasting model that learns an unsupervised representation of conversational dynamics and exploits it to predict future derailment as the conversation develops. By applying this model to two new diverse datasets of online conversations with labels for antisocial events, we show that it outperforms state-of-the-art systems at forecasting derailment.

show abstract

“…In this work we use the complete conversational history between English Wikipedia editors on both article and user talk pages. With over 90 million conversations between 4 million users on 24 million talk pages, this is one of the largest collections of public conversations [20].…”

Section: Blocks On Wikipediamentioning

confidence: 99%

Trajectories of Blocked Community Members: Redemption, Recidivism and Departure

Chang

Danescu-Niculescu-Mizil

2019

The World Wide Web Conference

Self Cite

View full text Add to dashboard Cite

Community norm violations can impair constructive communication and collaboration online. As a defense mechanism, community moderators often address such transgressions by temporarily blocking the perpetrator. Such actions, however, come with the cost of potentially alienating community members. Given this tradeoff, it is essential to understand to what extent, and in which situations, this common moderation practice is effective in reinforcing community rules.In this work, we introduce a computational framework for studying the future behavior of blocked users on Wikipedia. After their block expires, they can take several distinct paths: they can reform and adhere to the rules, but they can also recidivate, or straight-out abandon the community. We reveal that these trajectories are tied to factors rooted both in the characteristics of the blocked individual and in whether they perceived the block to be fair and justified. Based on these insights, we formulate a series of prediction tasks aiming to determine which of these paths a user is likely to take after being blocked for their first offense, and demonstrate the feasibility of these new tasks. Overall, this work builds towards a more nuanced approach to moderation by highlighting the tradeoffs that are in play.

show abstract

WikiConv: A Corpus of the Complete Conversational History of a Large Online Collaborative Community

Cited by 29 publications

References 12 publications

Preemptive Toxic Language Detection in Wikipedia Comments Using Thread-Level Context

Preemptive Toxic Language Detection in Wikipedia Comments Using Thread-Level Context

Trouble on the Horizon: Forecasting the Derailment of Online Conversations as they Develop

Trajectories of Blocked Community Members: Redemption, Recidivism and Departure

Contact Info

Product

Resources

About