2021
DOI: 10.1007/978-3-030-72376-7_2
|View full text |Cite
|
Sign up to set email alerts
|

Norm Violation in Online Communities – A Study of Stack Overflow Comments

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
2
2

Relationship

1
6

Authors

Journals

citations
Cited by 8 publications
(4 citation statements)
references
References 13 publications
0
4
0
Order By: Relevance
“…The sizes of the datasets after sampling are shown in Table 1. To identify the offensive comments, we obtained the Perspective API (PAPI) 7 score and Regular Expression (Regex) status of the comments since these measures have been used by the SO Heat Detection bot 8 to detect toxicity in SO comments. PAPI offers a 'toxicity' range for a text in the interval of 0 and 1, where 0 represents least toxic and 1 represents extreme toxicity.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…The sizes of the datasets after sampling are shown in Table 1. To identify the offensive comments, we obtained the Perspective API (PAPI) 7 score and Regular Expression (Regex) status of the comments since these measures have been used by the SO Heat Detection bot 8 to detect toxicity in SO comments. PAPI offers a 'toxicity' range for a text in the interval of 0 and 1, where 0 represents least toxic and 1 represents extreme toxicity.…”
Section: Methodsmentioning
confidence: 99%
“…The nature of gender hostility in SO has been analysed by Brooke [3]. Cheriyan et al [7] report norm violations in SO and provide manual analysis of comments to show that offence and unfriendliness also exist in SO [7]. Stress owing to toxicity in open source communities has been investigated by Raman et al [30].…”
Section: Related Work 21 Offensive Language Detectionmentioning
confidence: 99%
See 1 more Smart Citation
“…Such differences make chats in the synchronous domain more difficult to be moderated by existing approaches. Founta et al, 2018;Basile et al, 2019;ElSherief et al, 2021), Reddit (Datta and Adar, 2019;Kumar et al, 2018;Park et al, 2021), Stackoverflow (Cheriyan et al, 2017) and Github (Miller et al, 2022), efforts that extend them to live streaming platforms have been absent. In this paper, we study unique characteristics of comments in livestreaming services and develop new datasets and models for appropriately using contextual information to automatically moderate toxic content and norm violations.…”
Section: Introductionmentioning
confidence: 99%