Proceedings of the 16th International Conference on Extending Database Technology 2013
DOI: 10.1145/2452376.2452390
|View full text |Cite
|
Sign up to set email alerts
|

Processing multi-way spatial joins on map-reduce

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
19
0
4

Year Published

2014
2014
2020
2020

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 35 publications
(23 citation statements)
references
References 21 publications
0
19
0
4
Order By: Relevance
“…Unlike the naïve approaches discussed in [12], the cascaded pairwise spatial join in MSJS is efficient mainly because the disk I/O in Spark is much smaller than that in MapReduce. The series of pairwise spatial joins in MSJS do not perform as a number of map and reduce tasks but rather as a series of transactions in Spark that are executed in memory.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Unlike the naïve approaches discussed in [12], the cascaded pairwise spatial join in MSJS is efficient mainly because the disk I/O in Spark is much smaller than that in MapReduce. The series of pairwise spatial joins in MSJS do not perform as a number of map and reduce tasks but rather as a series of transactions in Spark that are executed in memory.…”
Section: Methodsmentioning
confidence: 99%
“…Gupta et al developed a Controlled-Replicate framework coupled with the project-split-replicate notation to handle multiway spatial join queries [12]. Controlled-Replicate runs as a cycle of two MapReduce jobs.…”
Section: Background and Related Workmentioning
confidence: 99%
“…Since the practical method to efficiently query against big spatial data is to employ the divide and conquer strategy [9,26], most MapReduce-based PSQPAs use certain types of space filling curves, such as Hilbert space-filling curve, to map MBRs to grids based on the spatial correlation for optimizing efficiency [27,28]. We simply treat the number of grids p as one of the internal parameters of Spark-based PSQPAs.…”
Section: Identifying Factors Impacting the Efficiency Of Spark-based mentioning
confidence: 99%
“…The basic units composing this complexity are multiway joins. Although joins are unavoidable and time-consuming, the projection operation mapping spatial correlated datasets into the same grids is commonly used in the filter and refinement stages [28]. From the viewpoint of Spark CM, the number of grids determines the number of tasks that should be executed, which can directly impact the efficiency of the PSQPAs.…”
Section: Identifying Factors Impacting the Efficiency Of Spark-based mentioning
confidence: 99%
“…The loose Octree [30] allows for a degree of imprecision so that objects can be assigned to lower levels when they intersect only slightly with a cell. The idea of using grids to parallelize the join has also been optimized for GPUs [33] as well as on a larger scale on the MapReduce framework [12,23].THERMAL-JOIN as presented here is single threaded but can be parallelized like the aforementioned approaches.…”
Section: Iterative Static Spatial Joinmentioning
confidence: 99%