Proceedings of the 17th International Conference on Information Integration and Web-Based Applications &Amp; Services 2015
DOI: 10.1145/2837185.2837208
|View full text |Cite
|
Sign up to set email alerts
|

Towards complete coverage in focused web harvesting

Abstract: With the goal of harvesting all information about a given entity, in this paper, we try to harvest all matching documents for a given query submitted on a search engine. The objective is to retrieve all information about for instance "Michael Jackson", "Islamic State", or "FC Barcelona" from indexed data in search engines, or hidden data behind web forms, using a minimum number of queries. Policies of web search engines usually do not allow accessing all of the matching query search results for a given query. … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
7
0

Year Published

2016
2016
2016
2016

Publication Types

Select...
2

Relationship

2
0

Authors

Journals

citations
Cited by 2 publications
(7 citation statements)
references
References 14 publications
0
7
0
Order By: Relevance
“…Web content changes rapidly [95,97]. In Focused Web Harvesting [84] which aim it is to achieve a complete harvest for a given topic, this dynamic nature of the web creates problems for users who need to access a set of all the relevant web data to their topics of interest. Whether you are a fan following your favorite idol or a journalist investigating a topic, you may need not only to access all the relevant information but also the recent changes and updates.…”
Section: Discussionmentioning
confidence: 99%
See 4 more Smart Citations
“…Web content changes rapidly [95,97]. In Focused Web Harvesting [84] which aim it is to achieve a complete harvest for a given topic, this dynamic nature of the web creates problems for users who need to access a set of all the relevant web data to their topics of interest. Whether you are a fan following your favorite idol or a journalist investigating a topic, you may need not only to access all the relevant information but also the recent changes and updates.…”
Section: Discussionmentioning
confidence: 99%
“…Surfacing approaches try to cover all the topics in a website. However, in focused web harvesting [84,86], harvesters focus on extracting all relevant information to a given query, topic or entity.…”
Section: Focused Web Harvestingmentioning
confidence: 99%
See 3 more Smart Citations