Using Decision Tree Algorithms in Detecting Spam Emails Written in Malay: A Comparison Study

Abdulrahman, Saifuldeen H; Salim, Mohammad A.

doi:10.1051/itmconf/20224201001

Cited by 1 publication

(1 citation statement)

References 14 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…These staggering numbers show how potentially dangerous it is to use this seemingly simple but effective communication tool in Figure 2 So far, there are many spam filtering techniques. Decision tree is one of them, [7] a method of supervised learning in which the main idea is top-down divide and conquer. First recurse upwards from the root position, find an attribute that can be divided at the intermediate node, all subsets continue to be recursively divided according to its internal nodes, if these subsets can be correctly classified, then the leaf nodes can be constructed, these subsets also need Classification to the corresponding leaf node, when each subset is classified to the leaf node, the decision tree is formed.…”

Section: Introductionmentioning

confidence: 99%

Naive Bayesian Spam Filtering

Zhu¹

2023

HSET

View full text Add to dashboard Cite

The spam filtering system is used to identify which emails in the received emails are completely meaningless to the recipient and perform operations such as interception and deletion. Nowadays, with the rapid development of the Internet, while e-mail provides convenience for people, spam also comes along with it, which brings many troubles to users. According to statistics, 80% of the emails in the world are spam, and e-spam is really annoying. Therefore, how to solve the problem of filtering emails has important practical significance. Spam filtering using Bayesian theory is a statistical technique applied to email filtering. It essentially uses Bayesian classification to discriminate the attributes of emails, including spam and non-spam. Bayesian-based spam filtering is a very effective technique that can modify the model to meet the needs of specific users and give a lower spam detection rate that is acceptable to users. In this experiment, we use Naive Bayes for experiments, and we use Unigram and bigram methods to preprocess the data, respectively. Finally, it is concluded that the data processing accuracy of unigram and bigram is greater than 0.75, and bigram performs better in four different evaluation indicators.

show abstract

Section: Introductionmentioning

confidence: 99%

Naive Bayesian Spam Filtering

Zhu¹

2023

HSET

View full text Add to dashboard Cite

show abstract

Using Decision Tree Algorithms in Detecting Spam Emails Written in Malay: A Comparison Study

Cited by 1 publication

References 14 publications

Naive Bayesian Spam Filtering

Naive Bayesian Spam Filtering

Contact Info

Product

Resources

About