2012
DOI: 10.1007/978-3-642-31753-8_33
|View full text |Cite
|
Sign up to set email alerts
|

Online Change Estimation Models for Dynamic Web Resources

Abstract: International audienceModern web 2.0 applications have transformed the Internet into an interactive, dynamic and alive information space. Personal weblogs, commercial web sites, news portals and social media applications generate highly dynamic information streams which have to be propagated to millions of users. This article focuses on the problem of estimating the publication frequency of highly dynamic web resources. We illustrate the importance of developing efficient online estimation techniques for impro… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2014
2014
2017
2017

Publication Types

Select...
2
2

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(2 citation statements)
references
References 16 publications
0
2
0
Order By: Relevance
“…In measurement literature, the majority of effort was spent on the behavior of web pages, including analysis of server logs [27], page-modification frequency during crawling [2], [4], [17], [24], RSS feed dynamics [34], and content change between consecutive observations [1], [14], [26]. Problems related to estimation of F U (x) have also emerged in prediction of future updates [5], [6], [13], [18], [31], [38], with a good survey in [25], and user lifetime measurement in decentralized P2P networks [3], [33], [37], [40].…”
Section: Related Workmentioning
confidence: 99%
“…In measurement literature, the majority of effort was spent on the behavior of web pages, including analysis of server logs [27], page-modification frequency during crawling [2], [4], [17], [24], RSS feed dynamics [34], and content change between consecutive observations [1], [14], [26]. Problems related to estimation of F U (x) have also emerged in prediction of future updates [5], [6], [13], [18], [31], [38], with a good survey in [25], and user lifetime measurement in decentralized P2P networks [3], [33], [37], [40].…”
Section: Related Workmentioning
confidence: 99%
“…To tackle both content and score dynamicity, a first solution consists in extending a content retrieval model (textual, spatial) and implementing adequate index structures and refresh strategies for low-latency and high throughput snapshot query evaluation. Best-effort refresh strategies [7,16] could determine optimal re-evaluation of active user queries which combined with real-time content indexing [3,2,31,22,19] can achieve high result freshness and completeness. However, real-time content indexing systems usually accommodate high arrival rates of items at the expense of result accuracy by either (a) excluding a significant portion of the incoming items (e.g.…”
Section: Introductionmentioning
confidence: 99%