The Case for Learned Spatial Indexes

Pandey, Varun; van Renen, Alexander; Kipf, Andreas; Sabek, Ibrahim; Ding, Jialin; Kemper, Alfons

doi:10.48550/arxiv.2008.10349

Cited by 2 publications

(2 citation statements)

References 29 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…A common approach is to map 2D cells into a 1D domain by enumerating them with a space-filling curve such as the Hilbert or Z curve. As we will show, we can achieve much higher lookup performance with linearized cells, even compared to well-tuned 2D spatial indexes [15]. Polygon Indexing.…”

Section: Data Accessmentioning

confidence: 94%

“…In our experiment, we use 39,200 polygons corresponding to the NYC Census regions (query polygons) and 1.2B points from the NYC taxi data set (years 2009 to 2016) [20]. We implemented the kd-tree, Quadtree, and STR-packed R-tree baselines based on recent research [15]. For the Boost R * -tree, we chose the bulk-loading Figure 4(a) shows the cumulative query time to find the total number of points inside the query polygons, while varying the precision of the raster approximation (i.e., number of approximating cells per query polygon).…”

Section: Data Accessmentioning

confidence: 99%

See 1 more Smart Citation

The Case for Distance-Bounded Spatial Approximations

Zacharatou,

Kipf,

Sabek

et al. 2020

Preprint

Self Cite

View full text Add to dashboard Cite

Spatial approximations have been traditionally used in spatial databases to accelerate the processing of complex geometric operations. However, approximations are typically only used in a first filtering step to determine a set of candidate spatial objects that may fulfill the query condition. To provide accurate results, the exact geometries of the candidate objects are tested against the query condition, which is typically an expensive operation. Nevertheless, many emerging applications (e.g., visualization tools) require interactive responses, while only needing approximate results. Besides, real-world geospatial data is inherently imprecise, which makes exact data processing unnecessary. Given the uncertainty associated with spatial data and the relaxed precision requirements of many applications, this vision paper advocates for approximate spatial data processing techniques that omit exact geometric tests and provide final answers solely on the basis of (fine-grained) approximations. Thanks to recent hardware advances, this vision can be realized today. Furthermore, our approximate techniques employ a distance-based error bound, i.e., a bound on the maximum spatial distance between false (or missing) and exact results which is crucial for meaningful analyses. This bound allows to control the precision of the approximation and trade accuracy for performance.

show abstract

Section: Data Accessmentioning

confidence: 94%

Section: Data Accessmentioning

confidence: 99%