2012 IEEE 26th International Conference on Advanced Information Networking and Applications 2012
DOI: 10.1109/aina.2012.118
|View full text |Cite
|
Sign up to set email alerts
|

Web Spam Detection Using Link-Based Ant Colony Optimization

Abstract: Web spam is one of the most important problems which degrade quality and efficiency of web search engines. In this paper, we present a novel link-based ant colony optimization learning algorithm for spam host detection. The host graph is first constructed by aggregating pages' hyperlink structure. Following the TrustRank assumption, ants start walking from a normal host and randomly follow host links with a probability distribution. Then, the classification rules are appropriately generated according to common… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0
1

Year Published

2012
2012
2020
2020

Publication Types

Select...
4
2
2

Relationship

1
7

Authors

Journals

citations
Cited by 17 publications
(10 citation statements)
references
References 15 publications
0
7
0
1
Order By: Relevance
“…Gray Hat SEO is a transformation from White to Black and from Black to White. Usually, most of the companies are practicing the Gray Hat techniques to some extent for Search Engine Optimization due to the pressure from website owners to deliver excellent and quick results [64]. Moreover, they are not crossing the line to Black Hat SEO.…”
Section: Gray Hat Search Engine Optimizationmentioning
confidence: 99%
“…Gray Hat SEO is a transformation from White to Black and from Black to White. Usually, most of the companies are practicing the Gray Hat techniques to some extent for Search Engine Optimization due to the pressure from website owners to deliver excellent and quick results [64]. Moreover, they are not crossing the line to Black Hat SEO.…”
Section: Gray Hat Search Engine Optimizationmentioning
confidence: 99%
“…Link analysis is done by Apichat et al [3] using ant colony optimization in order to classify spam pages created using link spamming. Here the host graph is constructed by aggregating hyperlink structure of pages and ant starts walking from a normal host and randomly follows host links with probability distribution of TrustRank assumption.Yutak et.…”
Section: Related Workmentioning
confidence: 99%
“…But one variant is generated from an empty rule while the other is generated by greedily adding antecedents to the original rule. Moreover, the pruning metric used here is (3) Then the smallest possible DL for each variant and the original rule is computed. The variant with the minimal DL is selected as the final representative of Ri in the ruleset.…”
Section: Optimization Stagementioning
confidence: 99%
“…Abernethy simultaneously exploits the structure of the Web graph as well as page content features for web spam detection [9]. Taweesiriwate present a link-based ant colony optimization learning algorithm for spam host detection [10]. Following the TrustRank assumption, ants start walking from a normal host and randomly follow host links with a probability distribution.…”
Section: Introductionmentioning
confidence: 99%