2016
DOI: 10.1016/j.csi.2015.07.001
|View full text |Cite
|
Sign up to set email alerts
|

A focused crawler combinatory link and content model based on T-Graph principles

Abstract: Abstract-The two significant tasks of a focused Web crawler are finding relevant topic-specific documents on the Web and analytically prioritizing them for later effective and reliable download. For the first task, we propose a sophisticated custom algorithm to fetch and analyze the most effective HTML structural elements of the page as well as the topical boundary and anchor text of each unvisited link, based on which the topical focus of an unvisited page can be predicted and elicited with a high accuracy. T… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2016
2016
2021
2021

Publication Types

Select...
2
2

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 26 publications
0
4
0
Order By: Relevance
“…1 illustrates the framework architecture of a Treasure-Crawler-based search engine and its modules which satisfy all the functional and non-functional requirements. The details of this architecture are giv-en in an under-review paper titled as: "A Focused Crawler Combinatory Link and Content Model Based on T-Graph Principles" [1]. These modules are designed in a way to have the ability of being plugged and played while requiring minimum changes in other modules or the adjacent module interfaces.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…1 illustrates the framework architecture of a Treasure-Crawler-based search engine and its modules which satisfy all the functional and non-functional requirements. The details of this architecture are giv-en in an under-review paper titled as: "A Focused Crawler Combinatory Link and Content Model Based on T-Graph Principles" [1]. These modules are designed in a way to have the ability of being plugged and played while requiring minimum changes in other modules or the adjacent module interfaces.…”
Section: Methodsmentioning
confidence: 99%
“…[8] In addition to the above procedure, the T-Graph structure as an exemplary guide carries out the task of priority association by providing a conceptual route for the crawler to follow and find on-topic regions. This phase is elaborated in [1].…”
Section: Watchdogmentioning
confidence: 99%
See 1 more Smart Citation
“…Seyfi et al [11,12] proposed a focused crawler by using T-graph principles. This work gives solution to two problems in the focused crawler platform.…”
Section: Vsm Crawler or Classic Focused Crawlermentioning
confidence: 99%