2022
DOI: 10.30865/mib.v6i2.3855
|View full text |Cite
|
Sign up to set email alerts
|

Feature Expansion Using Word2vec for Hate Speech Detection on Indonesian Twitter with Classification Using SVM and Random Forest

Abstract: Hate speech is one of the most common cases on Twitter. It is limited to 280 characters in uploading tweets, resulting in many word variations and possible vocabulary mismatches. Therefore, this study aims to overcome these problems and build a hate speech detection system on Indonesian Twitter. This study uses 20,571 tweet data and implements the Feature Expansion method using Word2vec to overcome vocabulary mismatches. Other methods applied are Bag of Word (BOW) and Term Frequency-Inverse Document Frequency … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
1
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(2 citation statements)
references
References 18 publications
0
2
0
Order By: Relevance
“…We carefully reviewed each document to obtain the key information of each work. In this part, we focus on [11], [30], [17], [23], [12], [28], [27], [21], [31], [32], [33], [34], [35], [36], [37], [38], [39], [40]…”
Section: B What Has Been Done So Far In Indonesian Abusive Language D...mentioning
confidence: 99%
“…We carefully reviewed each document to obtain the key information of each work. In this part, we focus on [11], [30], [17], [23], [12], [28], [27], [21], [31], [32], [33], [34], [35], [36], [37], [38], [39], [40]…”
Section: B What Has Been Done So Far In Indonesian Abusive Language D...mentioning
confidence: 99%
“…Shofianina Dwi Ananda Putri et al used Naive Bayes, SVM and other machine learning approach to detect hate speech and offensive language from Sundanese and Javanese Indonesian local language [9]. Mila Putri Kartika Dewi et al implemented a feature expansion system from comments using Word2Vec, Bag of Word (BOW) and TF-IDF [10]. Nurtheri Cahyana et al proposed an automatic annotation system using K-Nearest Neighbor to annotate comments datasets easily [11].…”
Section: Literature Studymentioning
confidence: 99%