Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data 2014
DOI: 10.1145/2588555.2595637
|View full text |Cite
|
Sign up to set email alerts
|

Orca

Abstract: The performance of analytical query processing in data management systems depends primarily on the capabilities of the system's query optimizer. Increased data volumes and heightened interest in processing complex analytical queries have prompted Pivotal to build a new query optimizer.In this paper we present the architecture of Orca, the new query optimizer for all Pivotal data management products, including Pivotal Greenplum Database and Pivotal HAWQ. Orca is a comprehensive development uniting state-of-thea… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2014
2014
2023
2023

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 52 publications
(5 citation statements)
references
References 21 publications
0
5
0
Order By: Relevance
“…SAP [5,19] for example reports that their core data services contain over 100 views that reference more than 100 tables, with the largest view referencing over 4000 tables. Similarly, we hear reports from Tableau [39] and VMware [37] about auto generated queries that are dozens of pages of SQL.…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…SAP [5,19] for example reports that their core data services contain over 100 views that reference more than 100 tables, with the largest view referencing over 4000 tables. Similarly, we hear reports from Tableau [39] and VMware [37] about auto generated queries that are dozens of pages of SQL.…”
Section: Discussionmentioning
confidence: 99%
“…After the classical approach in System R [31], Goetz Graefe pioneered the implementation of optimizations on relational algebra with the EXODUS [10], Volcano [8], and Cascades [9] systems. Modern optimizers like Calcite [3] or Orca [37] still use the same concepts. These systems all rely on operator centric optimizations that transform the plan with predefined rules.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…This proposal can be seen as an elegant adaptation of [39], proposed in a parallel database system, to a cloud system. More generally, with respect to the issue of query optimization in cloud environments, the most recent and relevant proposals are described in [11,41,53,61].…”
Section: Discussionmentioning
confidence: 99%
“…Hence, we consider Impala v1.3.1 as our main competitor. Since Impala's compiler supports only a subset of TPC-DS queries [21], the evaluation of Impala utilizes the Impala TPC-DS Kit from Cloudera for re-writing queries, available at https://github.com/cloudera/impala-tpcds-kit. In addition, we also compare OceanRT against Hive-on-Tez v0.12, and we have published the query re-writing toolkit for Hive at https://github.com/simonzhangsm/hive-testbench.…”
Section: Methodsmentioning
confidence: 99%