2020
DOI: 10.14778/3421424.3421425
|View full text |Cite
|
Sign up to set email alerts
|

Benchmarking learned indexes

Abstract: Recent advancements in learned index structures propose replacing existing index structures, like B-Trees, with approximate learned models. In this work, we present a unified benchmark that compares well-tuned implementations of three learned index structures against several state-of-the-art "traditional" baselines. Using four real-world datasets, we demonstrate that learned index structures can indeed outperform non-learned indexes in read-only in-memory workloads over a dense array. We investigate the impact… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

4
86
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
4
2
2

Relationship

3
5

Authors

Journals

citations
Cited by 106 publications
(90 citation statements)
references
References 21 publications
4
86
0
Order By: Relevance
“…We evaluate all join algorithms with real and synthetic datasets. For real datasets, we use three datasets from the SOSD benchmark [37], where we perform a self-join with each dataset. Any dataset is a list of unsigned 64-bit integer keys.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…We evaluate all join algorithms with real and synthetic datasets. For real datasets, we use three datasets from the SOSD benchmark [37], where we perform a self-join with each dataset. Any dataset is a list of unsigned 64-bit integer keys.…”
Section: Methodsmentioning
confidence: 99%
“…According to a recent benchmark study [37], both RMI and RadixSpline are considered the most efficient read-only learned indexing structures. Therefore, in this paper, we explore their usage in improving the performance of indexed nested loop joins as described later in Section 3.…”
Section: Cdf-based Partitioning and Learned Indexing Structuresmentioning
confidence: 99%
See 1 more Smart Citation
“…Recently, machine learning (ML) found its way into indexing. While learned indexes [26,24,32,25] may consume significantly less space than traditional indexes such as Btrees, that observation mostly applies to primary indexing where the base data is already sorted. In a secondary indexing setting such as ours, additional permutation vectors have to be maintained which will likely account for most of the space consumption of the learned index.…”
Section: Related Workmentioning
confidence: 99%
“…For example, such an index could be used for global dictionary encoding [6]. We introduce the RadixStringSpline (RSS), a learned string index consisting of a tree of the learned index structure RadixSpline (RS) [12,11,15]. RS is a learned index that consists of an error-bounded spline which is in turn indexed in a radix lookup table.…”
Section: Introductionmentioning
confidence: 99%