2017
DOI: 10.1007/978-3-319-72401-0_8
|View full text |Cite
|
Sign up to set email alerts
|

JCC-H: Adding Join Crossing Correlations with Skew to TPC-H

Abstract: Abstract. We introduce JCC-H, a drop-in replacement for the data and query generator of TPC-H, that introduces Join-Crossing-Correlations (JCC) and skew into its dataset and query workload. These correlations are carefully designed such that the filter predicates on table columns in the existing TPC-H queries now suddenly can have effects on the value-, frequency-and join-fan-out-distributions, experienced by operators in the query plan. The query generator of JCC-H is able to generate parameter bindings for t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
3
1
1

Relationship

1
4

Authors

Journals

citations
Cited by 10 publications
(3 citation statements)
references
References 8 publications
0
3
0
Order By: Relevance
“…In the Join Order Benchmark [22], the RJ performed worse because it is string-processing heavy. 11 JCC-H [8] provides a more realistic drop-in replacement for TPC-H with skew. It puts even more pressure on the radix join.…”
Section: Discussionmentioning
confidence: 99%
“…In the Join Order Benchmark [22], the RJ performed worse because it is string-processing heavy. 11 JCC-H [8] provides a more realistic drop-in replacement for TPC-H with skew. It puts even more pressure on the radix join.…”
Section: Discussionmentioning
confidence: 99%
“…The JCC-H [9] benchmark demonstrates these problems, where there are five populous orders that have very many lineitems. When we groupjoin these orders with the lineitem table, the index based execution is essentially singlethreaded, while the memoizing and eager right execution execute on all available cores.…”
Section: Using Indexes For Groupjoinsmentioning
confidence: 99%
“…It has been noted that synthetic benchmarks like TPC-H do not capture all relevant aspects of real workloads [6], [9]. Recently, a workload study was published [31] based on the Tableau Public 1 Business Intelligence (BI) free cloud service.…”
Section: B Public Bi Benchmarkmentioning
confidence: 99%