Proceedings of the 2016 International Conference on Management of Data 2016
DOI: 10.1145/2882903.2903741
|View full text |Cite
|
Sign up to set email alerts
|

The Snowflake Elastic Data Warehouse

Abstract: We live in the golden age of distributed computing. Public cloud platforms now offer virtually unlimited compute and storage resources on demand. At the same time, the Software-as-a-Service (SaaS) model brings enterprise-class systems to users who previously could not afford such systems due to their cost and complexity. Alas, traditional data warehousing systems are struggling to fit into this new environment. For one thing, they have been designed for fixed resources and are thus unable to leverage the cloud… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
80
0
2

Year Published

2016
2016
2022
2022

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 213 publications
(82 citation statements)
references
References 21 publications
0
80
0
2
Order By: Relevance
“…For example, authors used it to recommend projections in the Vertica column-store system. Snowflake [65] is an another commercial system capable of column-oriented data processing for semi-structured data. It adopts a Software-as-a-Service model and aims to free user from complex management tasks.…”
Section: Database Tuning For Column-storesmentioning
confidence: 99%
“…For example, authors used it to recommend projections in the Vertica column-store system. Snowflake [65] is an another commercial system capable of column-oriented data processing for semi-structured data. It adopts a Software-as-a-Service model and aims to free user from complex management tasks.…”
Section: Database Tuning For Column-storesmentioning
confidence: 99%
“…Nowadays, most analytics systems have adopted data skipping. Examples include Amazon Redshift [8], Google Powerdrill [1], Hive [43], IBM DB2 [40], Parquet [2], Vertica [11], Snowflake [16] and so on. Data skipping can be used to reduce data scan whether the underlying data layout is row-major or column-major.…”
Section: Related Workmentioning
confidence: 99%
“…Data skipping has become an essential mechanism for improving query performance in modern analytics databases (e.g., [1,8,16,26,40]) and the Hadoop ecosystem (e.g., [2,43]). In these systems, data are organized into blocks, each of which typically contains tens of thousands of tuples.…”
Section: Introductionmentioning
confidence: 99%
“…Relational data systems have been increasingly adopting LSM-style of updates. My-Rocks [28] uses RocksDB as storage engine and SQLite4 [62] has experimented with LSM-trees in its storage layer, while columnar systems use LSM-style updates [20,44,63,70].…”
Section: Introductionmentioning
confidence: 99%