2020
DOI: 10.48550/arxiv.2008.10349
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

The Case for Learned Spatial Indexes

Varun Pandey,
Alexander van Renen,
Andreas Kipf
et al.

Abstract: Spatial data is ubiquitous. Massive amounts of data are generated every day from billions of GPS-enabled devices such as cell phones, cars, sensors, and various consumerbased applications such as Uber, Tinder, location-tagged posts in Facebook, Twitter, Instagram, etc. This exponential growth in spatial data has led the research community to focus on building systems and applications that can process spatial data efficiently. In the meantime, recent research has introduced learned index structures. In this wor… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
1
1

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 29 publications
0
2
0
Order By: Relevance
“…A common approach is to map 2D cells into a 1D domain by enumerating them with a space-filling curve such as the Hilbert or Z curve. As we will show, we can achieve much higher lookup performance with linearized cells, even compared to well-tuned 2D spatial indexes [15]. Polygon Indexing.…”
Section: Data Accessmentioning
confidence: 94%
See 1 more Smart Citation
“…A common approach is to map 2D cells into a 1D domain by enumerating them with a space-filling curve such as the Hilbert or Z curve. As we will show, we can achieve much higher lookup performance with linearized cells, even compared to well-tuned 2D spatial indexes [15]. Polygon Indexing.…”
Section: Data Accessmentioning
confidence: 94%
“…In our experiment, we use 39,200 polygons corresponding to the NYC Census regions (query polygons) and 1.2B points from the NYC taxi data set (years 2009 to 2016) [20]. We implemented the kd-tree, Quadtree, and STR-packed R-tree baselines based on recent research [15]. For the Boost R * -tree, we chose the bulk-loading Figure 4(a) shows the cumulative query time to find the total number of points inside the query polygons, while varying the precision of the raster approximation (i.e., number of approximating cells per query polygon).…”
Section: Data Accessmentioning
confidence: 99%