2008
DOI: 10.1007/s00778-008-0119-9
|View full text |Cite
|
Sign up to set email alerts
|

Data integration with uncertainty

Abstract: This paper reports our first set of results on managing uncertainty in data integration. We posit that data-integration systems need to handle uncertainty at three levels, and do so in a principled fashion. First, the semantic mappings between the data sources and the mediated schema may be approximate because there may be too many of them to be created and maintained or because in some domains (e.g., bioinformatics) it is not clear what the mappings should be. Second, queries to the system may be posed with k… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
222
0
2

Year Published

2008
2008
2022
2022

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 220 publications
(224 citation statements)
references
References 23 publications
(23 reference statements)
0
222
0
2
Order By: Relevance
“…PayGo supports keyword searches but reformulates them into queries that attempt to identify relevant sources using the clusters previously built. UDI [17,23,24,60] is a dataspace proposal for integration of a large number of domain independent data sources automatically. In contrast to the proposals introduced so far, which either start with a manually defined integration schema or use the union of all source schemas as integration schema, UDI aims to derive a merged integration schema automatically, consolidating schema and instance references.…”
Section: Dataspace Management Systemsmentioning
confidence: 99%
“…PayGo supports keyword searches but reformulates them into queries that attempt to identify relevant sources using the clusters previously built. UDI [17,23,24,60] is a dataspace proposal for integration of a large number of domain independent data sources automatically. In contrast to the proposals introduced so far, which either start with a manually defined integration schema or use the union of all source schemas as integration schema, UDI aims to derive a merged integration schema automatically, consolidating schema and instance references.…”
Section: Dataspace Management Systemsmentioning
confidence: 99%
“…The issue of dealing with uncertainty in ontology matching has been addressed in [8,16,28,29,53,63]. A way of modeling ontology matching as an uncertain process is by using similarity matrices as a measure of certainty.…”
Section: Uncertainty In Ontology Matchingmentioning
confidence: 99%
“…In [28], uncertainty is refined by a comparison of K alignments, each with its own uncertainty measure (modeled as a fuzzy relation over the two ontologies) in order to improve precision of the matching results. Finally, the work in [16] introduced the notion of probabilistic schema mappings (correspondences), namely a set of mappings with a probability attached to each mapping; and, used it to answer queries with uncertainty about semi-automatically created mappings. Imprecise mappings can be further improved over time as deemed necessary, for example, within the settings of approximate data integration, see, e.g., [72].…”
Section: Uncertainty In Ontology Matchingmentioning
confidence: 99%
“…For building such a system, we take advantage of the background knowledge which is implied in the functional dependencies (FDs) defined on the schemas. Since probabilistic data models have shown to be promising [3][4][5], we build our approach on a probabilistic data model to capture the uncertainty which arises during the schema matching process. Therefore, we generate a set of Probabilistic Mediated Schemas (PMSs).…”
Section: Introductionmentioning
confidence: 99%