2015
DOI: 10.1007/s00799-015-0150-6
|View full text |Cite
|
Sign up to set email alerts
|

Not all mementos are created equal: measuring the impact of missing resources

Abstract: Web archives do not always capture every resource on every page that they attempt to archive. This results in archived pages missing a portion of their embedded resources. These embedded resources have varying historic, utility, and importance values. The proportion of missing embedded resources does not provide an accurate measure of their impact on the Web page; some embedded resources are more important to the utility of a page than others. We propose a method to measure the relative value of embedded resou… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
10
0

Year Published

2015
2015
2023
2023

Publication Types

Select...
4
4

Relationship

1
7

Authors

Journals

citations
Cited by 35 publications
(10 citation statements)
references
References 28 publications
0
10
0
Order By: Relevance
“…This set of challenges was followed by lack of knowledge (i.e., education on preservation), and the complexity of objects. The complexity of objects includes challenges associated with objects that are composed of many layers (Broussard and Boss 2018) may be dependent on external resources such as APIs (Boss and Broussard 2017) or embedded resources (Brunelle et al 2015(Brunelle et al , 2016(Brunelle et al , 2017. Several studies noted that knowledge about which objects to preserve in the first place was lacking (Barwick, Dearnley, and Muir 2011;Broussard 2015a), or understanding how to deal with versions (i.e., which version of an object is definitive?…”
Section: Challenges Associated With Sustaining Access To Dynamic Data Visualisationsmentioning
confidence: 99%
See 1 more Smart Citation
“…This set of challenges was followed by lack of knowledge (i.e., education on preservation), and the complexity of objects. The complexity of objects includes challenges associated with objects that are composed of many layers (Broussard and Boss 2018) may be dependent on external resources such as APIs (Boss and Broussard 2017) or embedded resources (Brunelle et al 2015(Brunelle et al , 2016(Brunelle et al , 2017. Several studies noted that knowledge about which objects to preserve in the first place was lacking (Barwick, Dearnley, and Muir 2011;Broussard 2015a), or understanding how to deal with versions (i.e., which version of an object is definitive?…”
Section: Challenges Associated With Sustaining Access To Dynamic Data Visualisationsmentioning
confidence: 99%
“…This category includes recommendations related to the maintenance of authenticity and integrity of digital objects as they are being kept in long-term storage. It further includes recommendations to test web archives for missing embedded resources (Brunelle et al 2015(Brunelle et al , 2016Kelly, Nelson, and Weigle 2014) and the need for development of validation methods for software preservation (Barateiro et al 2012).…”
Section: Recommendations For Preservation In the Literaturementioning
confidence: 99%
“…Although the gold standard for assessing web archiving quality is still human interaction with a Memento to ensure all embedded resources, links, and functionality are preserved, this level of assessment is clearly not scalable. We have been involved with a range of automated evaluations of the web archiving process, including the Archival Acid Test (Kelly et al, 2014a), which evaluates the capabilities of crawling and playback technology stacks (e.g., the Heritrix crawler (Mohr et al, 2004) and the Wayback Machine playback engine), and assessing Memento damage (Brunelle et al, 2014(Brunelle et al, , 2015 which provides weights to missing embedded resources based on heuristics for determining if the missing resource was 'important'.…”
Section: Memento Quality and Temporal Coherencementioning
confidence: 99%
“…I focus here on studies that compare coverage between collections for a particular research topic or domain, as opposed to general quantitative evaluation of an archive's coverage compared to what exists on the live web (e.g. Ainsworth et al, 2011;Ainsworth et al, 2015;Brunelle et al, 2015). For example, Brügger (2013a) considers the coverage of material relating to Danish parliamentary elections by comparing historical network graphs available from the Danish Netarkivet collection and the Internet Archive.…”
Section: Challenge 2: Critically Examining Collected Materialsmentioning
confidence: 99%