MapMerge: correlating independent schema mappings

Alexe, Bogdan; Hernández, Mauricio A.; Popa, Lucian; Tan, Wang-Chiew

doi:10.1007/s00778-012-0264-z

Cited by 15 publications

(21 citation statements)

References 20 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Example 1.2 (Ad Hoc Scenarios). MapMerge [1] is a system for correlating mappings using overlapping sets of target relations, or using relations that are associated via target constraints. To evaluate their approach, the authors used three real biological schemas (Gene Ontology, UniProt and BioWarehouse) and mapped the first two schemas to the third (BioWarehouse).…”

Section: Integration Scenariosmentioning

confidence: 99%

“…• We demonstrate the power of iBench by presenting a novel evaluation of MapMerge [1], comparing it to two other systems Clio [10] and ++Spicy [18] (Section 7). Our evaluation systematically varies the degree of source and target sharing among mappings in the generated scenarios.…”

Section: Contributionsmentioning

confidence: 99%

“…For mapping creation or adaptation, we can measure the similarity between instances produced by data exchange. Alexe et al [1] proposed a metric for measuring the preservation of data correlations in a target instance wrt. a source instance.…”

Section: Related Workmentioning

confidence: 99%

See 2 more Smart Citations

The iBench integration metadata generator

et al. 2015

View full text Add to dashboard Cite

Given the maturity of the data integration field it is surprising that rigorous empirical evaluations of research ideas are so scarce. We identify one major roadblock for empirical work-the lack of comprehensive metadata generators that can be used to create benchmarks for different integration tasks. This makes it difficult to compare integration solutions, understand their generality, and understand their performance. We present iBench, the first metadata generator that can be used to evaluate a wide-range of integration tasks (data exchange, mapping creation, mapping composition, schema evolution, among many others). iBench permits control over the size and characteristics of the metadata it generates (schemas, constraints, and mappings). We show that iBench can be used to create very large, complex, yet realistic scenarios. Our evaluation of iBench demonstrates that it can efficiently generate large scenarios with different characteristics. We also present an evaluation of two mapping creation systems using iBench and show that the intricate control that iBench provides over metadata scenarios can reveal new and important empirical insights into integration solutions. iBench is an open-source, extensible tool that we are providing to the community. We believe it will raise the bar for empirical evaluation and comparison of data integration systems.

show abstract

Section: Integration Scenariosmentioning

confidence: 99%

Section: Contributionsmentioning

confidence: 99%

See 1 more Smart Citation

The iBench integration metadata generator

et al. 2015

View full text Add to dashboard Cite

show abstract

“…Also related to our work, is the MapMerge operator developed by Alexe et al [5]. Given a set of mappings, which are expressed as second order tuple generating dependencies [21], between one (or more) source schema and a target schema, MapMerge correlates those mappings in a meaningful manner.…”

Section: Designing and Refining Schema Mappingsmentioning

confidence: 99%

Incrementally improving dataspaces based on user feedback

Belhajjame

Paton

Embury

et al. 2013

Information Systems

View full text Add to dashboard Cite

One aspect of the vision of dataspaces has been articulated as providing various benefits of classical data integration with reduced up-front costs. In this paper, we present techniques that aim to support schema mapping specification through interaction with end users in a pay-as-you-go fashion. In particular, we show how schema mappings, that are obtained automatically using existing matching and mapping generation techniques, can be annotated with metrics estimating their fitness to user requirements using feedback on query results obtained from end users.Using the annotations computed on the basis of user feedback, and given user requirements in terms of precision and recall, we present a method for selecting the set of mappings that produce results meeting the stated requirements. In doing so, we cast mapping selection as an optimization problem. Feedback may reveal that the quality of schema mappings is poor. We show how mapping annotations can be used to support the derivation of better quality mappings from existing mappings through refinement. An evolutionary algorithm is used to efficiently and effectively explore the large space of mappings that can be obtained through refinement.User feedback can also be used to annotate the results of the queries that the user poses against an integration schema. We show how estimates for precision and recall can be computed for such queries. We also investigate the problem of propagating feedback about the results of (integration) queries down to the mappings used to populate the base relations in the integration schema.

show abstract

“…In EIRENE [10], data examples are used to refine schema mappings. The main difficulty when attempting to characterize data transformation using data examples is likely to describe the same behavior.The existing uncorrelated mappings that may result in duplication of data and loss of associations in data exchange.MapMerge [11] exploits constraints in the source and target schemas to find the associations and improves the quality of mappings and increases the scalabilty.In generalization relation null values may occur if it is realized through materializing all specific classes inside a single table and which leads to ambiguous mappings and incorrect data exchange.When exchanging incomplete data and mapping inversion in the source may arise null values [14].The problem of entity fragmentation, and the inability to resolve ambiguous data exchange scenarios caused by different implementations of a generalization relation in source and target, are consequences of ignoring data level mappings. The gap between data level and schema level mappings in schema mapping-based data exchange results in semantic heterogeneities, and consequently, incorrect and redundant target instances.…”

Section: Introductionmentioning

confidence: 99%

Privacy Preserving Data Integration Through L-Diversity

2017

IJAERD

View full text Add to dashboard Cite

show abstract

MapMerge: correlating independent schema mappings

Cited by 15 publications

References 20 publications

The iBench integration metadata generator

The iBench integration metadata generator

Incrementally improving dataspaces based on user feedback

Privacy Preserving Data Integration Through L-Diversity

Contact Info

Product

Resources

About