2011
DOI: 10.1007/978-3-642-20573-6_55
|View full text |Cite
|
Sign up to set email alerts
|

Naive Bayes Approach for Website Classification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
16
0

Year Published

2014
2014
2024
2024

Publication Types

Select...
5
3

Relationship

2
6

Authors

Journals

citations
Cited by 30 publications
(16 citation statements)
references
References 3 publications
0
16
0
Order By: Relevance
“…In this study, a Quick Reduct algorithm was used for dimensionality reduction and information gain was used for feature selection. The study concluded that this approach would improve the accuracy and efficiency of the classifier Rajalakshmi & Aravindan [14] performed a study that used only the URL of web page as the feature. This has a great advantage because the contents of a web page need not be fetched.…”
Section: B Feature Selection and Feature Extraction Techniquesmentioning
confidence: 99%
“…In this study, a Quick Reduct algorithm was used for dimensionality reduction and information gain was used for feature selection. The study concluded that this approach would improve the accuracy and efficiency of the classifier Rajalakshmi & Aravindan [14] performed a study that used only the URL of web page as the feature. This has a great advantage because the contents of a web page need not be fetched.…”
Section: B Feature Selection and Feature Extraction Techniquesmentioning
confidence: 99%
“…They reported an F1 measure of 0.525, by applying Maximum Entropy as their classifier on WebKB dataset. For classifying URLs, an n-gram based approach is followed by Rajalakshmi and Aravindan (2011) in which only 3-grams derived from URLs are used as the features. In this approach, the dimensionality of feature vector is restricted to a maximum of 26 3 features.…”
Section: Web Page Classificationmentioning
confidence: 99%
“…URL classification problem is studied by many researchers (Kan, 2004;Kan and Thi, 2005;Baykan et al, 2011;Rajalakshmi and Aravindan, 2011;Singh et al, 2012) and various URL features are suggested in the Science Publications JCS literature. Kan and Thi (2005) suggested segmentation techniques for extracting features from URLs.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Then, the multiclass decision is made by combining the decisions of all the binary classifiers, and the ties are broken at random when one or more binary classifiers say “yes“ or none of the classifiers say “yes.“ Therefore, it resulted in a poor multiclass performance. In our previous works and the work reported by Baykan et al, only URL features were used for classification, but no feature selection method was applied. In their research work, they suggested the use of all n ‐grams ( n =4 to 8) for a URL classification problem.…”
Section: Introductionmentioning
confidence: 99%