2004
DOI: 10.1007/978-3-540-30217-9_110
|View full text |Cite
|
Sign up to set email alerts
|

Web Page Classification with an Ant Colony Algorithm

Abstract: Abstract. This paper utilizes Ant-Miner -the first Ant Colony algorithm for discovering classification rules -in the field of web content mining, and shows that it is more effective than C5.0 in two sets of BBC and Yahoo web pages used in our experiments. It also investigates the benefits and dangers of several linguistics-based text preprocessing techniques to reduce the large numbers of attributes associated with web content mining.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
15
0

Year Published

2005
2005
2010
2010

Publication Types

Select...
5
1
1

Relationship

1
6

Authors

Journals

citations
Cited by 39 publications
(15 citation statements)
references
References 11 publications
0
15
0
Order By: Relevance
“…A number of techniques have been proposed based on extracted terms and textual information [2,3,4]. A set of words extracted from web documents are used as features for classification algorithms.…”
Section: Related Workmentioning
confidence: 99%
“…A number of techniques have been proposed based on extracted terms and textual information [2,3,4]. A set of words extracted from web documents are used as features for classification algorithms.…”
Section: Related Workmentioning
confidence: 99%
“…IDF TF × = weight system and the paper [18], in which the authors describe a web page classification system using an ant colony algorithm for classification but relying heavily on WordNet for processing of web pages.…”
Section: Wordnetmentioning
confidence: 99%
“…The algorithm implements the basic idea of awarding the best attributes (used by the ants to construct the best rules) with pheromone, which increases the probability of those attributes being selected by the next ants to construct other rules. A simple high-level pseudocode of Ant-Miner is shown in Pseudocode 1, adapted from [7]. A more detailed description of Ant-Miner can be found in [3].…”
Section: The Original Ant-miner Classification Algorithmmentioning
confidence: 99%
“…All attributes within these datasets are binary, where each attribute denotes whether or not a given word occurs in a given web page (example). These datasets have been collected by and previously been experimented with Ant-Miner by Holden & Freitas [7].…”
Section: Experimental Setup and Datasets Used In The Experimentsmentioning
confidence: 99%