2023
DOI: 10.1016/j.csl.2022.101464
|View full text |Cite
|
Sign up to set email alerts
|

HOTTEST: Hate and Offensive content identification in Tamil using Transformers and Enhanced STemming

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 17 publications
(3 citation statements)
references
References 7 publications
0
3
0
Order By: Relevance
“…A feature shared by the above studies is the focus on addressing cyberbullying challenges associated with high-resource languages. There is an increasing interest in the detection of offensive content and hate speech in low-resource languages such as Tamil [34,35], Pashto [36], Urdu [37], Persian [38]. Similar studies have been focused on improving resources for tackling offensive and hateful content detection [39,40,41,42].…”
Section: Offensive Content Detectionmentioning
confidence: 99%
“…A feature shared by the above studies is the focus on addressing cyberbullying challenges associated with high-resource languages. There is an increasing interest in the detection of offensive content and hate speech in low-resource languages such as Tamil [34,35], Pashto [36], Urdu [37], Persian [38]. Similar studies have been focused on improving resources for tackling offensive and hateful content detection [39,40,41,42].…”
Section: Offensive Content Detectionmentioning
confidence: 99%
“…The results showed that Stacking-LR provided the highest ACC of the three cases. In a recent study [29], the authors combine linguistic-based analysis with ensemble learning techniques to analyze Tamil text found in YouTube comments. The text representations were generated by employing a combination of data stemming and the utilization of the MuRIL pre-trained transformer developed by Google.…”
Section: B Ensemble Technique In Sentimental Analysismentioning
confidence: 99%
“…The RoBERTa model has exhibited state-of-the-art performance for crosslingual and multi-lingual NLP tasks [12]. Likewise, the MuRIL model demonstrated benchmark performance in mono-lingual and multi-lingual classification tasks for Indian languages [13].…”
Section: Introductionmentioning
confidence: 99%