2003 Symposium on Applications and the Internet, 2003. Proceedings.
DOI: 10.1109/saint.2003.1183045
|View full text |Cite
|
Sign up to set email alerts
|

Identifying junk electronic mail in Microsoft outlook with a support vector machine

Abstract: Abstract

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
16
0

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 19 publications
(16 citation statements)
references
References 3 publications
0
16
0
Order By: Relevance
“…The BDT learning time is too long. Woitaszek and colleagues [9] of a SVM simple and a personal dictionary to determine the e-mails are commercially used.…”
Section: Related Workmentioning
confidence: 99%
“…The BDT learning time is too long. Woitaszek and colleagues [9] of a SVM simple and a personal dictionary to determine the e-mails are commercially used.…”
Section: Related Workmentioning
confidence: 99%
“…The number of selected attributes is 10, 10, and 9, respectively. Eight attributes {7, 16,21,23,25,46,52, and 53} are common to all three input subsets. This indicates the effectiveness of such attributes as spam predictors and the robustness of the modeling process.…”
Section: Aim Abductive Network Modelingmentioning
confidence: 99%
“…Table 6 shows the structure of the resulting models for the 3 ensemble members together with the classification accuracy for each individual model on both the training set (948 cases) and evaluation set (1757 cases). The models select between 9 and 11 attributes as inputs, with 7 attributes {7, 16,25,27,46, 52, and 53} being common to all three models. Out of this subset of 7 attributes, six attributes {7, 16, 25, 46, 52, and 53} were also common to the models in Table 3.…”
Section: Improving Classification Accuracy Using Abductive Network Enmentioning
confidence: 99%
See 1 more Smart Citation
“…According the description given in Woitaszek et al [11], an e-mail may be represented by a feature vector x that is composed of the various words from a dictionary formed by analyzing the collected e-mails. Thus, an e-mail is classified as spam or non-spam by performing a simple dot product between the features of an e-mail and the SVM model weight vector,…”
Section: Support Vector Machinementioning
confidence: 99%