2011
DOI: 10.5120/1974-2646
|View full text |Cite
|
Sign up to set email alerts
|

Machine Learning methods for E-mail Classification

Abstract: The increasing volume of unsolicited bulk e-mail (also known as spam) has generated a need for reliable antispam filters. Using a classifier based on machine learning techniques to automatically filter out spam email has drawn many researchers attention. In this paper we review some of the most popular machine learning methods (Bayesian classification, k-NN, ANNs, SVMs, Artificial immune system and Rough sets) and of their applicability to the problem of spam Email classification. Descriptions of the algorithm… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2015
2015
2022
2022

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 38 publications
(8 citation statements)
references
References 11 publications
0
8
0
Order By: Relevance
“…The report by Awad et al [3] is more of an educational paper than a research paper. The author reviews six most popular classification methods (Bayesian classification, ANNs, SVMs, k-NN, Rough sets, and Artificial immune system) to perform a spam email classification task.…”
Section: Related Workmentioning
confidence: 99%
“…The report by Awad et al [3] is more of an educational paper than a research paper. The author reviews six most popular classification methods (Bayesian classification, ANNs, SVMs, k-NN, Rough sets, and Artificial immune system) to perform a spam email classification task.…”
Section: Related Workmentioning
confidence: 99%
“…The attribute which has the greatest information gain will be chosen. To calculate two types of entropy and information gain, formulas are defined in Eqs ( 6) and (7).…”
Section: Classification Techniquesmentioning
confidence: 99%
“…Overfitting occurs when the complex model is made for a simple dataset. Mostly spam email affect the user in the form of time consumption while reading spam email, bandwidth and in form of space that is required for the storage of spam email [7]. Users spend a lot of time reading spam emails which are useless for them.…”
Section: Introductionmentioning
confidence: 99%
“…Naive Bayes algorithm is one of the machine learning methods that is used in text classification. It is a statistical inference based on probability, and is used to determine previously created classes [9]. NB uses a discriminant function to compute the conditional probabilities of P(Ci|X).…”
Section: Naive Bayes (Nb)mentioning
confidence: 99%