2017
DOI: 10.1007/978-3-319-68288-4_19
|View full text |Cite
|
Sign up to set email alerts
|

Challenges of Source Selection in the WoD

Abstract: Federated querying, the idea to execute queries over several distributed knowledge bases, lies at the core of the semantic web vision. To accommodate this vision, SPARQL provides the SERVICE keyword that allows one to allocate sub-queries to servers. In many cases, however, data may be available from multiple sources resulting in a combinatorially growing number of alternative allocations of subqueries to sources. Running a federated query on all possible sources might not be very lucrative from a user's point… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
3
2
1

Relationship

2
4

Authors

Journals

citations
Cited by 7 publications
(6 citation statements)
references
References 25 publications
0
6
0
Order By: Relevance
“…In particular, the marketplace would have to estimate how many triples have to be accessed to calculate the price of a specific allocation prior to query execution. However, as we have shown in [15], such estimations can be very unreliable. A high discrepancy between estimated price and actual price can be very devastating for such a marketplace.…”
Section: Providermentioning
confidence: 91%
See 1 more Smart Citation
“…In particular, the marketplace would have to estimate how many triples have to be accessed to calculate the price of a specific allocation prior to query execution. However, as we have shown in [15], such estimations can be very unreliable. A high discrepancy between estimated price and actual price can be very devastating for such a marketplace.…”
Section: Providermentioning
confidence: 91%
“…This decision, in turn, depends on how much the bought data can contribute to a specific query. As we showed in [15], it is very difficult to decide before query execution how much a certain source can contribute to a query answer. However, after query execution it is too late to decide against the inclusion of some sources, as the data is already bought.…”
Section: Introductionmentioning
confidence: 99%
“…Hence, a customer might either waste money on triples which do not join or miss a part of the query answer because some triples were not bought. As we have shown in [8], join estimation techniques are suffering a lot from false-positive matches in the WoD setting and hence, we cannot expect that a customer would be able to use such techniques to buy exactly those triples which are needed to form a specific query answer. Only a query execution can reveal the true contribution and value of the publishers' triples.…”
Section: A Marketplace For Commercial Datamentioning
confidence: 99%
“…Afterwards, the federated query processing contains the following steps: "Query Parsing, Data Source Selection, Query Optimization and Query Execution". Query Parsing is the process of parsing and transforming a given query expressed by using SPARQL query language into a query execution tree [186], while Data Source Selection is used for finding the relevant datasets (i.e., SPARQL endpoints) for a triple (or a set of triples) pattern of a given SPARQL query [108]. By having selected the datasets for a given query, Query Optimization process starts for placing the triple patterns into groups, and it is used for determining in an efficient way the order of joins and triple patterns.…”
Section: Mediator (Virtual Integration)mentioning
confidence: 99%
“…As regards Catalog/index-free approaches, FedX [223] system sends one SPARQL ASK query per triple pattern to the federated datasets' endpoints for detecting relevant datasets, e.g., for a triple pattern {?s foaf:name ?name}, it send the following query to each dataset D i : "ASK {?s foaf:name ?name}", which returns true if D i contains triples with this property. Finally, for decreasing the execution time of source selection task, query approximation techniques have been proposed [108].…”
Section: Contextmentioning
confidence: 99%