2023
DOI: 10.1016/j.ipm.2023.103454
|View full text |Cite
|
Sign up to set email alerts
|

Cyberbullying detection for low-resource languages and dialects: Review of the state of the art

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
13
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 74 publications
(14 citation statements)
references
References 82 publications
1
13
0
Order By: Relevance
“…This process enables the segmentation of text into meaningful linguistic units, such as words or subwords, thereby facilitating further analysis. Our approach to tokenization aligns with previous research on the Chittagonian language, which also emphasized the importance of breaking down text into linguistic units [7].…”
Section: Tokenizationsupporting
confidence: 73%
See 4 more Smart Citations
“…This process enables the segmentation of text into meaningful linguistic units, such as words or subwords, thereby facilitating further analysis. Our approach to tokenization aligns with previous research on the Chittagonian language, which also emphasized the importance of breaking down text into linguistic units [7].…”
Section: Tokenizationsupporting
confidence: 73%
“…These characters encompass periods, commas, question marks, exclamation marks, hyphens, parentheses, quotation marks, and other nonalphanumeric symbols. While some punctuation may not significantly alter the meaning of a sentence, their presence can impact text classification tasks, including automatic cyberbullying detection, as demonstrated by Mahmud et al [7] in previous research. Their findings suggest that removing punctuation can enhance text classification accuracy, particularly when employing traditional machine learning algorithms.…”
Section: Removing Punctuationsmentioning
confidence: 87%
See 3 more Smart Citations