Leveraging compression in the tableau data engine

Wesley, Richard; Terlecki, Paweł

doi:10.1145/2588555.2595639

Cited by 12 publications

(10 citation statements)

References 8 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…This section gives a quick overview of one specific technique and is largely a summary of Sect. 5.2 from [2].…”

Section: Leverage Encoding For Query Executionmentioning

confidence: 97%

“…It has been described in [1] and [2]. Most features described in the above papers have been shipped before Tableau 9.0, except for the new performance improvements covered in Sect.…”

Section: Tableau Data Enginementioning

confidence: 99%

“…Such techniques have been discussed in [2]. The implementation has now become part of the Tableau 9.0 release.…”

Section: Leverage Encoding For Query Executionmentioning

confidence: 99%

See 2 more Smart Citations

On Improving User Response Times in Tableau

Terlecki

Shaw

et al. 2015

Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data

Self Cite

View full text Add to dashboard Cite

The rapid increase in data volumes and complexity of applied analytical tasks poses a big challenge for visualization solutions. It is important to keep the experience highly interactive, so that users stay engaged and can perform insightful data exploration.Query processing usually dominates the cost of visualization generation. Therefore, in order to achieve acceptable response times, one needs to utilize backend capabilities to the fullest and apply techniques, such as caching or prefetching. In this paper we discuss key data processing components in Tableau: the query processor, query caches, Tableau Data Engine [1, 2] and Data Server. Furthermore, we cover recent performance improvements related to the number and quality of remote queries, broader reuse of cached data, and application of inter and intra query parallelism.

show abstract

“…This section gives a quick overview of one specific technique and is largely a summary of Sect. 5.2 from [2].…”

Section: Leverage Encoding For Query Executionmentioning

confidence: 97%

“…It has been described in [1] and [2]. Most features described in the above papers have been shipped before Tableau 9.0, except for the new performance improvements covered in Sect.…”

Section: Tableau Data Enginementioning

confidence: 99%

See 1 more Smart Citation

On Improving User Response Times in Tableau

Terlecki

Shaw

et al. 2015

Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data

Self Cite

View full text Add to dashboard Cite

show abstract

“…Several systems follow this approach. Microsoft PowerBI [67] using Di-rectQuery [26] and Polaris/Tableau [90,98,99] provide plug-ins to many analytics engines; as discussed in [17], the users of such systems have to carefully avoid many queries that cannot be answered efficiently. IBM BigSheets [12] computes interactively only over a subset of the data; once the user settles on a query, it is actually run in batch mode using Spark.…”

Section: Related Workmentioning

confidence: 99%

Hillview

et al. 2019

View full text Add to dashboard Cite

Hillview is a distributed spreadsheet for browsing very large datasets that cannot be handled by a single machine. As a spreadsheet, Hillview provides a high degree of interactivity that permits data analysts to explore information quickly along many dimensions while switching visualizations on a whim. To provide the required responsiveness, Hillview introduces visualization sketches, or vizketches, as a simple idea to produce compact data visualizations. Vizketches combine algorithmic techniques for data summarization with computer graphics principles for efficient rendering. While simple, vizketches are effective at scaling the spreadsheet by parallelizing computation, reducing communication, providing progressive visualizations, and offering precise accuracy guarantees. Using Hillview running on eight servers, we can navigate and visualize datasets of tens of billions of rows and trillions of cells, much beyond the published capabilities of competing systems. PVLDB Reference Format:

show abstract

“…During inter-query parallelization, to maximize multi-core utilization multiple such queries are executed concurrently. On the other hand, most systems such as MonetDB, Vectorwise, Tableau, and SQL Server [7,30] use intra-query parallelization, using the exchange operator [16], where a single query executes on multiple cores. We use the following setup to understand which technique performs better.…”

Section: Inter-query Vs Intra-querymentioning

confidence: 99%

Multi-core column-store parallelization under concurrent workload

Gawade

Kersten

Simitsis

2016

Proceedings of the 12th International Workshop on Data Management on New Hardware

View full text Add to dashboard Cite

Columnar database systems, designed for an optimal OLAP workload performance, strive for maximum multi-core utilization under concurrent query executions. However, multicore parallel plan generated for isolated execution leads to suboptimal performance during concurrent query execution. In this paper, we analyze the concurrent workload resource contention effects on multi-core plans using three intra-query parallelization techniques, static, adaptive, and cost model parallelization. We focus on a plan level comparison of selected TPC-H queries, using in-memory multicore columnar systems. Excessive partitions in statically parallelized plans result into heavy L3 cache misses leading to memory contention, degrading query performance severely. Overall, adaptive plans show more robustness, less scheduling overheads, and an average 50% execution time improvement compared to statically parallelized plans, and cost model based plans.

show abstract

Leveraging compression in the tableau data engine

Cited by 12 publications

References 8 publications

On Improving User Response Times in Tableau

On Improving User Response Times in Tableau

Hillview

Multi-core column-store parallelization under concurrent workload

Contact Info

Product

Resources

About