2012
DOI: 10.1007/s00799-012-0094-z
|View full text |Cite
|
Sign up to set email alerts
|

Archiving the web using page changes patterns: a case study

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
19
0

Year Published

2013
2013
2017
2017

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 18 publications
(19 citation statements)
references
References 33 publications
0
19
0
Order By: Relevance
“…Various efforts currently exist to capture the web at the local, national, international as well as institutional or community levels [50,62,78]. Studies of these efforts center around more efficient methods for capturing the dynamic web and increasing discoverability of the archived web by examining tools, policies, and metadata [19,38,67].…”
Section: Literature Reviewmentioning
confidence: 99%
“…Various efforts currently exist to capture the web at the local, national, international as well as institutional or community levels [50,62,78]. Studies of these efforts center around more efficient methods for capturing the dynamic web and increasing discoverability of the archived web by examining tools, policies, and metadata [19,38,67].…”
Section: Literature Reviewmentioning
confidence: 99%
“…Ben Saad and Gançarski performed a similar study regarding the importance of changes on a page [7][8][9]. Gray and Martin created a framework for high-quality mementos and assessing their quality by measuring the missing embedded resources [19].…”
Section: Related Workmentioning
confidence: 99%
“…8 When using a page-at-a-time archival service, the resulting memento contains embedded resources with the same archival datetime [1]. This section identifies our damage measurement of this page-at-a-time archiver and outlines the differences between Heritrix and WebCite.…”
Section: Measuring Webcitementioning
confidence: 99%
“…Ben Saad et al [13] claimed that using patterns is an effective way to predict changes, and then used this prediction to optimize the archiving process by crawling only important pages.…”
Section: Related Workmentioning
confidence: 99%