2021
DOI: 10.17993/3ctic.2021.102.101-121
|View full text |Cite
|
Sign up to set email alerts
|

Development of computational linguistic resources for automated detection of textual cyberbullying threats in Roman Urdu language

Abstract: Automatic Cyberbullying detection has remained very challenging task since social media content and conversations are usually posted in unstructured free-text form leaving behind the language norms. The major concern and gap in formulating cyberbullying detection strategies is scarcity of available linguistic resources typically for newly evolved languages. Roman Urdu has recently emerged and hence is a resource poor language. Urdu has been widely known as the national language of Pakistan. However, because of… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
13
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
8
2

Relationship

1
9

Authors

Journals

citations
Cited by 22 publications
(14 citation statements)
references
References 24 publications
1
13
0
Order By: Relevance
“…Cyberbullying comments in our sample contained more function words, prepositions, tense markers, and numbers that could easily be filtered as stop words, implying that stop-word selection can be explored further in the future. This finding is consistent with that of Dewani et al (2021), who noted that stop words, such as "here" and "today" in some time-sensitive and locationsensitive key messages, could affect NLP results. More subtle cyberbullying may be achieved linguistically through the use of these kinds of words as modifications or similes (Tan, 2019).…”
Section: General Linguistic Features Of Cyberbullyingsupporting
confidence: 91%
“…Cyberbullying comments in our sample contained more function words, prepositions, tense markers, and numbers that could easily be filtered as stop words, implying that stop-word selection can be explored further in the future. This finding is consistent with that of Dewani et al (2021), who noted that stop words, such as "here" and "today" in some time-sensitive and locationsensitive key messages, could affect NLP results. More subtle cyberbullying may be achieved linguistically through the use of these kinds of words as modifications or similes (Tan, 2019).…”
Section: General Linguistic Features Of Cyberbullyingsupporting
confidence: 91%
“…Kappa coefficient is an index for conformance testing [36][37]. Where NT is the number of environments, N is the total index, gi is the classification result, and pi is the classification result predicted by the model [38][39].…”
Section: Table 1 Confusion Matrix Of Training Data Setmentioning
confidence: 99%
“…It uses much fewer limits than the MLP network. It makes it possible to build a hidden 3D CNN model than an MLP network [33][34]. The hidden 3D CNN typically can better learn the high-order correlation between embedded dimensions, thus bringing higher accuracy for the recovery of missing data.…”
Section: Detail Of the Ntc Modelmentioning
confidence: 99%