Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data 2014
DOI: 10.1145/2588555.2595639
|View full text |Cite
|
Sign up to set email alerts
|

Leveraging compression in the tableau data engine

Abstract: Data sets are growing rapidly and there is an attendant need for tools that facilitate human analysis of them in a timely manner. To help meet this need, column-oriented databases (or "column stores") have come into wide use because of their low latency on analytic workloads. Column stores use a number of techniques to produce these dramatic performance techniques, including the ability to perform operations directly on compressed data.In this paper, we describe how the Tableau Data Engine (an internally devel… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2015
2015
2023
2023

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 12 publications
(10 citation statements)
references
References 8 publications
0
10
0
Order By: Relevance
“…This section gives a quick overview of one specific technique and is largely a summary of Sect. 5.2 from [2].…”
Section: Leverage Encoding For Query Executionmentioning
confidence: 97%
See 2 more Smart Citations
“…This section gives a quick overview of one specific technique and is largely a summary of Sect. 5.2 from [2].…”
Section: Leverage Encoding For Query Executionmentioning
confidence: 97%
“…It has been described in [1] and [2]. Most features described in the above papers have been shipped before Tableau 9.0, except for the new performance improvements covered in Sect.…”
Section: Tableau Data Enginementioning
confidence: 99%
See 1 more Smart Citation
“…Several systems follow this approach. Microsoft PowerBI [67] using Di-rectQuery [26] and Polaris/Tableau [90,98,99] provide plug-ins to many analytics engines; as discussed in [17], the users of such systems have to carefully avoid many queries that cannot be answered efficiently. IBM BigSheets [12] computes interactively only over a subset of the data; once the user settles on a query, it is actually run in batch mode using Spark.…”
Section: Related Workmentioning
confidence: 99%
“…During inter-query parallelization, to maximize multi-core utilization multiple such queries are executed concurrently. On the other hand, most systems such as MonetDB, Vectorwise, Tableau, and SQL Server [7,30] use intra-query parallelization, using the exchange operator [16], where a single query executes on multiple cores. We use the following setup to understand which technique performs better.…”
Section: Inter-query Vs Intra-querymentioning
confidence: 99%