2017
DOI: 10.1016/j.procs.2017.09.128
|View full text |Cite
|
Sign up to set email alerts
|

Experimental Study Of Feature Weighting Techniques For URL Based Webpage Classification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
4
4

Relationship

1
7

Authors

Journals

citations
Cited by 15 publications
(5 citation statements)
references
References 13 publications
0
5
0
Order By: Relevance
“…Rajalakshmi and Xavier [1] described Web page classification using URL based features. They have used classification steps namely preprocessing, feature weighting methods (n-Gram, Term Frequency and Mutual Information), training (70%) and testing samples (30%) of WebKB dataset.…”
Section: Literature Overviewmentioning
confidence: 99%
“…Rajalakshmi and Xavier [1] described Web page classification using URL based features. They have used classification steps namely preprocessing, feature weighting methods (n-Gram, Term Frequency and Mutual Information), training (70%) and testing samples (30%) of WebKB dataset.…”
Section: Literature Overviewmentioning
confidence: 99%
“…Using statistically relevant tokens as features, we classified the URLs in a multiclass scenario. The number of token features present in each statistical dictionary SDC k for all 13 Naive Bayes algorithm, the exact token matches with the SDC k are assigned with the term goodness of the corresponding tokens as the weights ignoring the partial matches. Table 5 shows the results of this experiment.…”
Section: Experiments To Ascertain the Importance Of Feature Selectionmentioning
confidence: 99%
“…Some URLs may not contain any related information about a web page, and hence pose more challenges for classification (eg, http://www.clker.com/). URL classification problem has been studied by various researchers, [6][7][8][9][10][11][12][13][14][15] and various methods have been suggested in the literature. The issues in these existing methods are discussed below.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Zhao et al [9] used the Naive Bayes algorithm and SVM to classify RESTful services by extracting features from web page content and document structure information. Rajalakshmi and Xaviar [16] conducted an experimental study on URL-based web page classification feature-weighting techniques and used SVM to perform web page classification. Altay et al [17] used context-sensitive and keyword density to extract features and used SVM, maximum entropy, and Extreme learning machine for malicious web page detection.…”
Section: Related Workmentioning
confidence: 99%