2016 5th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions) (ICRITO) 2016
DOI: 10.1109/icrito.2016.7785024
|View full text |Cite
|
Sign up to set email alerts
|

Self-adaptive ontology-based focused crawling: A literature survey

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(3 citation statements)
references
References 22 publications
0
3
0
Order By: Relevance
“…The main role of the web crawler is to simplify and to automate the entire crawling process and makes the data crawling easy and accessible to everyone. According to the [5], crawlers are classified into two main categories: classical (traditional) crawlers and focused crawlers. The classical (traditional) web crawlers navigate the webpages and gather both relevant and irrelevant information which is a huge waste of crawling time and the storage of the downloaded information [6] Focused crawlers do not crawl the whole web as opposed to the traditional crawlers, as they only crawl to the deepest the specific part of the web that is related to the given topic.…”
Section: Data Extractionmentioning
confidence: 99%
See 1 more Smart Citation
“…The main role of the web crawler is to simplify and to automate the entire crawling process and makes the data crawling easy and accessible to everyone. According to the [5], crawlers are classified into two main categories: classical (traditional) crawlers and focused crawlers. The classical (traditional) web crawlers navigate the webpages and gather both relevant and irrelevant information which is a huge waste of crawling time and the storage of the downloaded information [6] Focused crawlers do not crawl the whole web as opposed to the traditional crawlers, as they only crawl to the deepest the specific part of the web that is related to the given topic.…”
Section: Data Extractionmentioning
confidence: 99%
“…Financial banking institution (4). Financial banking product (5). Characteristics that have been evaluated by the users (6).…”
Section: Data Storagementioning
confidence: 99%
“…Because of this, traditional crawlers extract a large amount of data and often a big part of it proves to be irrelevant to users. [13] On the other hand, topic crawlers are agents that collect web pages that satisfy certain specific properties. They offer the possibility of downloading relevant web documents for a predefined domain, providing the most up-todate resources (web pages) relevant to the needs of users, with minimum consumption of resources such as storage, time and network bandwidth.…”
Section: Web Data Extractionmentioning
confidence: 99%