Proceedings of the 2018 International Conference on Management of Data 2018
DOI: 10.1145/3183713.3196909
|View full text |Cite
|
Sign up to set email alerts
|

The Case for Learned Index Structures

Abstract: Indexes are models: a B-Tree-Index can be seen as a model to map a key to the position of a record within a sorted array, a Hash-Index as a model to map a key to a position of a record within an unsorted array, and a BitMap-Index as a model to indicate if a data record exists or not. In this exploratory research paper, we start from this premise and posit that all existing index structures can be replaced with other types of models, including deep-learning models, which we term learned indexes. We theoreticall… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

4
620
0
4

Year Published

2019
2019
2023
2023

Publication Types

Select...
4
3
3

Relationship

0
10

Authors

Journals

citations
Cited by 677 publications
(675 citation statements)
references
References 59 publications
4
620
0
4
Order By: Relevance
“…In addition, a recent paper featuring learned indexes [24] discusses the cases of using complex machine learning models such as neural networks and multivariate regression models to predict locations of keys. As opposed to learned indexes, Hermit models the correlation between two columns and leverages the curve-fitting technique to adaptively create simple yet customized ML models for different regions (TRS-Tree tree nodes).…”
Section: D3 Complex Machine Learning Modelsmentioning
confidence: 99%
“…In addition, a recent paper featuring learned indexes [24] discusses the cases of using complex machine learning models such as neural networks and multivariate regression models to predict locations of keys. As opposed to learned indexes, Hermit models the correlation between two columns and leverages the curve-fitting technique to adaptively create simple yet customized ML models for different regions (TRS-Tree tree nodes).…”
Section: D3 Complex Machine Learning Modelsmentioning
confidence: 99%
“…For example, the nearest neighbor interpolation of a point is equivalent to allocating indices of one to its neighbor and then map the value of the point. In this sense, indices are models [24], therefore indices can be modeled and learned. In this work, we model indices as a function of the local feature map and learn an index function to perform upsampling within deep CNNs.…”
Section: Introductionmentioning
confidence: 99%
“…The performance of the algorithm should be bounded as a function of some measure of the oracle error, even though the algorithm is oblivious to this error. The ML advice model has in the past been applied to the ski rental problem [13,4], job scheduling [13,12] and online revenue maximization [11]; it has also been used to achieve theoretical and practical gains in streaming frequency estimation [5] and data structures [7]. Most relevant to this paper is prior work by [8] in which it was shown how the model can be applied to the online caching problem.…”
Section: Introductionmentioning
confidence: 99%