2006
DOI: 10.1007/11890850_17
|View full text |Cite
|
Sign up to set email alerts
|

A Provenance Model for Manually Curated Data

Abstract: Abstract. Many curated databases are constructed by scientists integrating various existing data sources "by hand", that is, by manually entering or copying data from other sources. Capturing provenance in such an environment is a challenging problem, requiring a good model of the process of curation. Existing models of provenance focus on queries/views in databases or computations on the Grid, not updates of databases or Web sites. In this paper we motivate and present a simple model of provenance for manuall… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
16
0
5

Year Published

2006
2006
2023
2023

Publication Types

Select...
4
4

Relationship

0
8

Authors

Journals

citations
Cited by 28 publications
(21 citation statements)
references
References 15 publications
0
16
0
5
Order By: Relevance
“…The importance of provenance information in scientific data and workflow management is widely recognized, as witnessed, e.g., by specialized workshops [4,1], research projects [17], and surveys [3,20] dedicated to this topic, and by investigations on foundations of data provenance for queries and transformations [5,9,23]. However, current scientific workflow systems still offer little or no support for queries of interest to the end-users of these systems, e.g., researchers in the life or physical sciences.…”
Section: Introductionmentioning
confidence: 99%
“…The importance of provenance information in scientific data and workflow management is widely recognized, as witnessed, e.g., by specialized workshops [4,1], research projects [17], and surveys [3,20] dedicated to this topic, and by investigations on foundations of data provenance for queries and transformations [5,9,23]. However, current scientific workflow systems still offer little or no support for queries of interest to the end-users of these systems, e.g., researchers in the life or physical sciences.…”
Section: Introductionmentioning
confidence: 99%
“…2. Extensibility: Support for the addition of domain specific provenance information (requirement based on the multiplicity of provenance models and applications expressed in [12,13,[15][16][17][18][19][20][21][22], use case III, quality dimensions 2, 3, 5, 6, 11, 12).…”
Section: Requirements For a Provenance Model For The Webmentioning
confidence: 99%
“…Examples of such systems include operating system [218][159], desktop [265], statistical packages [143,21], databases [59,406,176], data warehouses [104], and workflow systems (Kepler [39,41], Taverna [419], VisTrails [156], VDS [424], Pegasus [278]). Other approaches allow for varying degrees of open-ness: curated databases allow for user-edits [52], PASOA allows for service-oriented architectures [195] where the number of components can dynamically change and is not known at design-time, or some allow for data to be published on the Web, with data and provenance both accessible by simple browser navigation [447]. Others even aim at tracking the provenance of objects in the physical world [231].…”
Section: Assumptionsmentioning
confidence: 99%
“…Buneman et al [51,52] assume that changes to the database can be modeled using transactions comprising simple "copy-paste" updates. They derive a model of provenance that captures such update sequence, which allows them to reconstitute changes to the database.…”
Section: Broadening the Scope Of Provenance Beyondmentioning
confidence: 99%