2008
DOI: 10.1109/tkde.2008.76
|View full text |Cite
|
Sign up to set email alerts
|

Text Document Preprocessing with the Bayes Formula for Classification Using the Support Vector Machine

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
122
0

Year Published

2012
2012
2023
2023

Publication Types

Select...
6
4

Relationship

0
10

Authors

Journals

citations
Cited by 193 publications
(134 citation statements)
references
References 8 publications
1
122
0
Order By: Relevance
“…In this stage the dataset get normalized and prepared for the classification algorithm so that the particular algorithm can run smoothly and bring effective results in minimum time [8]. According to many researches, parameters for pre-processing includes TF-IDF, Stemmer, stopwords Handler and tokenizer etc [1], [25], [30]. In this study we have used the default parameters for preprocessing as shown in Figure 2.…”
Section: Pre-processingmentioning
confidence: 99%
“…In this stage the dataset get normalized and prepared for the classification algorithm so that the particular algorithm can run smoothly and bring effective results in minimum time [8]. According to many researches, parameters for pre-processing includes TF-IDF, Stemmer, stopwords Handler and tokenizer etc [1], [25], [30]. In this study we have used the default parameters for preprocessing as shown in Figure 2.…”
Section: Pre-processingmentioning
confidence: 99%
“…Isa et al, in [24] used the Bayes formula to represent the document as a set of vectors according to a probability division revealing the categories that the document possibly will belong to. This probability distribution as the vectors to represent the document, the SVM is used to classify the documents.…”
Section: Backgroungmentioning
confidence: 99%
“…For assigning web page to category following formula is calculated for each document for every category [17]:…”
Section: Naïve Bayesian Classifiermentioning
confidence: 99%