2020
DOI: 10.1088/1757-899x/835/1/012036
|View full text |Cite
|
Sign up to set email alerts
|

Detecting Hoaxes in Indonesian News Using TF/TDM and K Nearest Neighbor

Abstract: The presence of the internet and the rapid growth of social media had given rise to the blossoming of hoax creation and distribution through it. A hoax can cause anxiety and reactivity to its readers and could harm a certain party. Thereby, it is important to detect and report hoaxes to stop its spreading as soon as possible. This research aims to utilize the K Nearest Neighbor (KNN) classification algorithm to detect whether a piece of news is a hoax or not. Experiments were done by using 74 hoaxes compiled f… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
5
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
6
2
1

Relationship

1
8

Authors

Journals

citations
Cited by 11 publications
(5 citation statements)
references
References 11 publications
0
5
0
Order By: Relevance
“…The purpose of the stop word removal process is to speed up process performance because it reduces some common words, as well as to improve model performance because the data to be modeled is data that specifically contains important words that represent a certain class [4]- [6], [11]. For the stop word removal process carried out in this study, the researcher used the Indonesian language stop word list from Talla F.Z [17].…”
Section: Figure 2 Pre-processing Stepsmentioning
confidence: 99%
See 1 more Smart Citation
“…The purpose of the stop word removal process is to speed up process performance because it reduces some common words, as well as to improve model performance because the data to be modeled is data that specifically contains important words that represent a certain class [4]- [6], [11]. For the stop word removal process carried out in this study, the researcher used the Indonesian language stop word list from Talla F.Z [17].…”
Section: Figure 2 Pre-processing Stepsmentioning
confidence: 99%
“…As a result, information is now easier, faster, and more flexible to be accessed anywhere and anytime regardless of the location and place where the information is located. These developments certainly cannot be separated from the role of the internet which is currently evolving continuously and has even transformed into a major need for humans [4]- [6].…”
Section: Introductionmentioning
confidence: 99%
“…We are using Term Frequency-Inverse Document Frequency (TF-IDF) to give weights for every word to create a vector space for the text. TF-IDF is commonly used in text classification tasks such as in [1]. This research is essentially a multiclass classification problem.…”
Section: -03mentioning
confidence: 99%
“…This classification problem can be solved automatically using Deep Learning models. There have been many studies on the classification of hoaxes in Indonesian, from those using word similarity measurement theories such as Text Rank and Dice Similarity [4], Extreme Gradient Boosting [5], classic machine learning methods such as Naive Bayes [6]- [9], K-Nearest Neighbor [10], Random Forest [7], Decision Tree [7], and Support Vector Machine [6], [7], [11], to deep learning methods such as Convolution Neural Network (CNN) [6], [12], Recurrent Neural Network (RNN) [13], Long Short-Term Memory (LSTM) [6], [14], and Gated Recurrent Unit (GRU) [6], [12], [14]. The deep learning methods that have been widely used to classify hoaxes in Indonesian language each have advantages and disadvantages.…”
Section: Introductionmentioning
confidence: 99%