International audienceThis work concerns query processing to support data sharing in large scale Virtual Organizations(VO). Characterization of VO's data sharing contexts reflects the coexistence of factors like sources overlapping, uncertain data location, and fuzzy copies in dynamic large scale environments that hinder query processing. Existing results on distributed query evaluation are useful for VOs, but there is no appropriate solution combining high semantic level and dynamic large scale environments required by VOs. This paper proposes a characterization of VOs data sources, called Data Profile, and a query processing strategy (called QPro2e) for large scale VOs with complex data profiles. QPro2e uses an evolving distributed knowledge base describing data sources roles w.r.t shared domain concepts. It allows the identification of logical data source clusters which improve query evaluation in presence of a very large number of data sources
This paper proposes to apply Multiple Kernel Learning and Indefinite Kernels (IK) to combine and tune Similarity Measures within the context of Ontology Instance Matching. We explain why MKL can be used in parameter selection and similarity measure combination; argue that IK theory is required in order to use MKL within this context; propose a configuration that makes use of both concepts; and present, using the IIMB bechmark, results of a prototype to show the feasibility of this idea in comparison with other matching tools.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.