2016
DOI: 10.1007/978-3-319-41576-5_12
|View full text |Cite
|
Sign up to set email alerts
|

S2X: Graph-Parallel Querying of RDF with GraphX

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
37
0
1

Year Published

2017
2017
2020
2020

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 52 publications
(38 citation statements)
references
References 5 publications
0
37
0
1
Order By: Relevance
“…The results of this experiment include a comparative evaluation of our method against four state-of-the-art public diskbased distributed RDF systems proposed in the most recent three years, including DREAM [7], S2X [19], S2RDF [20], and CliqueSquare [4], which are provided by [1]. Other distributed RDF systems in the most recent three years are either unreleased, or are memory-based systems that are in different environments than targeted in this study.…”
Section: F Online Performance Comparisonmentioning
confidence: 99%
See 1 more Smart Citation
“…The results of this experiment include a comparative evaluation of our method against four state-of-the-art public diskbased distributed RDF systems proposed in the most recent three years, including DREAM [7], S2X [19], S2RDF [20], and CliqueSquare [4], which are provided by [1]. Other distributed RDF systems in the most recent three years are either unreleased, or are memory-based systems that are in different environments than targeted in this study.…”
Section: F Online Performance Comparisonmentioning
confidence: 99%
“…First, some recent works (e.g., [4], [20], [19]) focus on managing RDF datasets using cloud platforms. CliqueSquare [4] discusses how to build query plans by relying on n-ary (star) equality joins in Hadoop.…”
Section: Related Workmentioning
confidence: 99%
“…S2X [51] exploits the inherited graph structure of RDF to process SPARQL as graph-based computations on top of GraphX. It uses the parallel vertex-centric model to evaluate the BGP matching of SPARQL while other operators, such as OPTIONAL and FILTER, are processed through Spark RDD operators.…”
Section: Mapreduce and Graph Based Systemsmentioning
confidence: 99%
“…Each SPARQL query is decomposed into multiple subqueries, which are then evaluated independently. Since the data is [46] Subject Hash Distributed Semi-Join CliqueSquare [25] Hybrid (Hash + VP) MapReduce-based Join DREAM [38] No partitioning; full replication RDF-3X [53] EAGRE [56] METIS MapReduce-based Join gStoreD [45] Partitioning Agnostic gStore [37] H-RDF-3X [29] METIS RDF-3X [53] H2RDF+ [41] H-Base partitioner (range) Centralized + MapReduce HadoopRDF [30] VP + predicate files on HDFS MapReduce Join Partout [36] Workload-based fragmentation RDF-3X [53] PigSparql [14] Hash + Triple-based files SPARQL to PigLatin S2RDF [15] Extended Vertical Partitioning SPARQL to SQL S2X [51] GraphX partitioning strategy Vertex-Centric BGP matching Sedge [57] Subject Hash Vertex-Centric BGP matching Sempala [50] VP SPARQL to SQL SHAPE [32] Semantic Hash Partitioning RDF-3X [53] SHARD [47] Hash MapReduce-based Join TriAD [48] Hash-based Sharding Distributed Merge/Hash Joins TriAD-SG [48] METIS + Horizontal Sharding Distributed Merge/Hash Joins Trinity.RDF [33] Key-value store on graph Graph Exploration WARP [28] METIS on query workload RDF-3X [53] In this survey, we categorize distributed RDF management systems along 2 dimensions based on their execution model: (i) MapReduce and Graph-based systems: such systems rely on general purpose frameworks, i.e., Hadoop or Spark, that offer seamless data distribution and parallelization at the cost of flexibility. (ii) Specialized RDF systems: are built specifically for SPARQL query evaluation by utilizing custom physical layouts, native RDF indexing, efficient communication protocols and explicit replication.…”
Section: Distributed Rdf Systemsmentioning
confidence: 99%
See 1 more Smart Citation