Designing an Adaptive VM That Combines Vectorized and JIT Execution on Heterogeneous Hardware

Gubner, Tim

doi:10.1109/icde.2018.00215

Cited by 4 publications

(2 citation statements)

References 26 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…During query execution, the VM can start interpreting code fragments using highly efficient vectorized interpretation. Later, the VM can generate optimized code for expensive fragments, tailored specifically to the current workload and hardware [17].…”

Section: Discussionmentioning

confidence: 99%

Charting the design space of query execution using VOILA

Gubner

Boncz

2021

Proc. VLDB Endow.

Self Cite

View full text Add to dashboard Cite

Database architecture, while having been studied for four decades now, has delivered only a few designs with well-understood properties. These few are followed by most actual systems. Acquiring more knowledge about the design space is a very time-consuming processes that requires manually crafting prototypes with a low chance of generating material insight. We propose a framework that aims to accelerate this exploration process significantly. Our framework enables synthesizing many different engines from a description in a carefully designed domain-specific language (VOILA). We explain basic concepts and formally define the semantics of VOILA. We demonstrate VOILA's flexibility by presenting translation back-ends that allow the synthesis of state-of-the-art paradigms (data-centric compilation, vectorized execution, AVX-512), mutations and mixes thereof. We show-case VOILA's flexibility by exploring the query engine design space in an automated fashion. We generated thousands of query engines and report our findings. Queries generated by VOILA achieve similar performance as state-of-the-art hand-optimized implementations and are up to 35.5X faster than well-known systems.

show abstract

Section: Discussionmentioning

confidence: 99%

Charting the design space of query execution using VOILA

Gubner

Boncz

2021

Proc. VLDB Endow.

Self Cite

View full text Add to dashboard Cite

show abstract

“…As for the Fluid Co-Processing framework, we intend to mature our prototype into a system that can execute generic and concurrent queries and perform adaptive offloading to heterogeneous hardware [11].…”

Section: Discussionmentioning

confidence: 99%

Fluid Co-processing

Gubner

Tomé

Lang³

et al. 2019

Proceedings of the 15th International Workshop on Data Management on New Hardware

Self Cite

View full text Add to dashboard Cite

It has so far been unclear which data-intensive CPU tasks can be accelerated with GPUs, as GPUs are bottlenecked by the slow bus connection to the CPU and the limited size of GPU memories. In this paper we demonstrate a database workload where co-processing actually helps: accelerating large join pipelines where the join condition is selective, by pushing down a Bloom filter test for early pruning. GPUs are more powerful than CPUs for computing hash functions needed in Bloom filter tests, have a local memory with significantly more random-access bandwidth than the CPU, and since only keys (or extracts thereof) have to be moved to the GPU, data transfers over the bus are relatively small. Our microbenchmarks show that raw Bloom filter lookups are up to 6× faster on the GPU than on the CPU in case the Bloom filter is larger than the CPU cache. The next quest is for a database architecture that allows efficient CPU-GPU co-processing. We present a new heterogeneous query processing framework based on fluid coprocessing. In fluid co-processing, tasks of different sizes-that fit the device-are dynamically co-processed. Early results show that fluid co-processing consistently improves end-to-end CPU performance of early pruning in join queries thanks to the GPU, by factors up to 2-3×.

show abstract