2010
DOI: 10.1007/978-3-642-17746-0_27
|View full text |Cite
|
Sign up to set email alerts
|

SPARQL Query Optimization on Top of DHTs

Abstract: Abstract. We study the problem of SPARQL query optimization on top of distributed hash tables. Existing works on SPARQL query processing in such environments have never been implemented in a real system, or do not utilize any optimization techniques and thus exhibit poor performance. Our goal in this paper is to propose efficient and scalable algorithms for optimizing SPARQL basic graph pattern queries. We augment a known distributed query processing algorithm with query optimization strategies that improve pe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
35
0

Year Published

2011
2011
2020
2020

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 40 publications
(35 citation statements)
references
References 27 publications
0
35
0
Order By: Relevance
“…Similar to our work, [52] recently proposed databaseoriented query optimization strategies for RDF query processing on top of DHTs in the context of the Atlas system [53]. While one of our main goals is the optimization of the number of messages required for processing a query, in that work the authors explicitly focus on reducing the required bandwidth.…”
Section: Related Workmentioning
confidence: 97%
See 1 more Smart Citation
“…Similar to our work, [52] recently proposed databaseoriented query optimization strategies for RDF query processing on top of DHTs in the context of the Atlas system [53]. While one of our main goals is the optimization of the number of messages required for processing a query, in that work the authors explicitly focus on reducing the required bandwidth.…”
Section: Related Workmentioning
confidence: 97%
“…In contrast, UniStore focuses on less reliable systems and proposes to make extensive use of parallel processing strategies. [52] discusses very interesting and important extensions for our system, while on the other hand the Atlas system can benefit from the different processing strategies we discuss. An evaluation in a large local cluster of powerful machines shows that the idea of efficient RDF query processing in DHT systems can scale to millions of triples.…”
Section: Related Workmentioning
confidence: 99%
“…A popular data partitioning algorithm for RDF data is hash partitioning [13], [14], [18]. This approach distributes RDF triples across different partitions by computing a hash key over the subject or the object of each triple.…”
Section: Related Workmentioning
confidence: 99%
“…A popular approach to partition RDF data is hash partitioning, which is adopted by a majority of the existing distributed RDF engines [13], [14], [18], [24]. This approach distributes RDF triples across different partitions by computing a hash key over either the subject or the object of each triple.…”
Section: Introduction Rdf (Resource Description Framework)mentioning
confidence: 99%
“…The bound-is-easier selection function is commonly used in recursive query evaluation, where the atom with the largest number of constants is evaluated first, in the hope of returning the smallest intermediate relation [14,16]. One can find extensions of this selection function in the literature, such as in [9] for a Semantic Web setting with binary predicates. There, the position of the bound argument is considered, where atoms with a bound first argument (subject) are preferred over those with a bound second argument (object) for two arguments with the same number of bindings.…”
Section: Sub-query Schedulingmentioning
confidence: 99%