2020
DOI: 10.14778/3389133.3389135
|View full text |Cite
|
Sign up to set email alerts
|

The PGM-index

Abstract: We present the first learned index that supports predecessor, range queries and updates within provably efficient time and space bounds in the worst case. In the (static) context of just predecessor and range queries these bounds turn out to be optimal. We call this learned index the Piecewise Geometric Model index (PGM-index). Its flexible design allows us to introduce three variants which are novel in the context of learned data structures. The first variant of the PGM-index is able to adapt itself to the di… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
29
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
3
3

Relationship

0
10

Authors

Journals

citations
Cited by 161 publications
(30 citation statements)
references
References 29 publications
1
29
0
Order By: Relevance
“…Recursive model indexes (RMIs) are one such class of models [8] (although others [3][4][5]11] exist as well) , combining simpler machine learning models together into a multistaged structure. For example, as depicted in Figure 1, an RMI with two stages, a linear stage and a cubic stage, would first use a linear model to make an initial prediction of an index for a specific key (stage 1).…”
Section: Introductionmentioning
confidence: 99%
“…Recursive model indexes (RMIs) are one such class of models [8] (although others [3][4][5]11] exist as well) , combining simpler machine learning models together into a multistaged structure. For example, as depicted in Figure 1, an RMI with two stages, a linear stage and a cubic stage, would first use a linear model to make an initial prediction of an index for a specific key (stage 1).…”
Section: Introductionmentioning
confidence: 99%
“…The motivation is that an index can be seen as a function mapping a search key to the storage position of the corresponding record. Several follow-up studies propose learned indexes for one-dimensional data [17,20,69]. More details can be found in a benchmark study [41].…”
Section: Training Timementioning
confidence: 99%
“…IFB-tree [37] evaluates the update cost with interpolation-friendliness, such as a partition in uniform distribution with higher interpolation-friendliness. PGM-index [38] admits a streaming algorithm to partition, instead of using FITing-tree's greedy algorithm, and handles updates using LSM-tree. Shift-table [39] resolves the local biases of learned models at the cost of (at most) one memory lookup.…”
Section: Learned Indicesmentioning
confidence: 99%