2017
DOI: 10.1371/journal.pone.0171518
|View full text |Cite
|
Sign up to set email alerts
|

An efficient incremental learning mechanism for tracking concept drift in spam filtering

Abstract: This research manages in-depth analysis on the knowledge about spams and expects to propose an efficient spam filtering method with the ability of adapting to the dynamic environment. We focus on the analysis of email’s header and apply decision tree data mining technique to look for the association rules about spams. Then, we propose an efficient systematic filtering method based on these association rules. Our systematic method has the following major advantages: (1) Checking only the header sections of emai… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
14
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 20 publications
(14 citation statements)
references
References 26 publications
0
14
0
Order By: Relevance
“…There are four commonly used techniques for spam classification namely, a) Use of blacklist [14] b) Protocol-based approach c) Use of keywords or content filtering d) Header based [20], [28], [21], [5], [36], [13] In the first case, a list of email the network administrator maintains addresses or domain name databases. The classifier matches new record with blacklisted database and simply rejects some mails and puts them onto the spam folder.…”
Section: Literature Reviewmentioning
confidence: 99%
See 1 more Smart Citation
“…There are four commonly used techniques for spam classification namely, a) Use of blacklist [14] b) Protocol-based approach c) Use of keywords or content filtering d) Header based [20], [28], [21], [5], [36], [13] In the first case, a list of email the network administrator maintains addresses or domain name databases. The classifier matches new record with blacklisted database and simply rejects some mails and puts them onto the spam folder.…”
Section: Literature Reviewmentioning
confidence: 99%
“…Spam classification helps us to filter the unwanted emails from the email Inbox. There have been various attempts to classify the spam email based on using email header [20], [21], [5], [36], [37], [38], [13], [4], using email body [3], [41], [35], [29], [27], [30], [7], [31], [32], [33], [34] and also using both body and header [18], [23], [21], [15], [42] and statistical features [19], [25]. The email header classification is performed using techniques such as Naïve Bayes (NB), Decision Tree (DT) [40] [43], and Support Vector Machine (SVM) [23], [24], [20], [13], [26] Random Forest (RF) [4], [13].…”
Section: Literature Reviewmentioning
confidence: 99%
“…Recently, the batch algorithms are being continuously improved. Some of them try to choose different base-classifiers, such as Decision Tree, Fuzzy Rule, K-nearest neighbor and so on [6][7][8]. Some of them try to choose different window error thresholds to improve the classification accuracy [9].…”
Section: Relevant Algorithmsmentioning
confidence: 99%
“…Denkowski et al (2014) describe a framework for building adaptive MT systems that learn from post-editor feedback, and 3) incremental learning for spam filtering, e.g. Sheu et al (2017) use a window-based technique to estimate for the condition of concept drift for each incoming new email.…”
Section: Introductionmentioning
confidence: 99%