2017
DOI: 10.14778/3055540.3055545
|View full text |Cite
|
Sign up to set email alerts
|

Distributed join algorithms on thousands of cores

Abstract: Traditional database operators such as joins are relevant not only in the context of database engines but also as a building block in many computational and machine learning algorithms. With the advent of big data, there is an increasing demand for efficient join algorithms that can scale with the input data size and the available hardware resources.In this paper, we explore the implementation of distributed join algorithms in systems with several thousand cores connected by a low-latency network as used in hi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
40
0

Year Published

2017
2017
2020
2020

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 66 publications
(41 citation statements)
references
References 29 publications
1
40
0
Order By: Relevance
“…The shuffling operator is the only operator that will transmit and receive data over the network. Slow networks can be a bottleneck for parallel database systems [33] and data shuffling has been shown to be a significant contributor to the end-to-end query response time [2,3,37].…”
Section: Data Shuffling In Parallel Database Systemsmentioning
confidence: 99%
See 2 more Smart Citations
“…The shuffling operator is the only operator that will transmit and receive data over the network. Slow networks can be a bottleneck for parallel database systems [33] and data shuffling has been shown to be a significant contributor to the end-to-end query response time [2,3,37].…”
Section: Data Shuffling In Parallel Database Systemsmentioning
confidence: 99%
“…Still, about 30% of the cycles are idle and would be devoted to other activities in a well-designed database system. Barthels et al [2] Figure 10(b) MESQ/SR vs. IPoIB with 16 nodes in the FDR cluster). As also seen in the repartition pattern, the MESQ/SR algorithm shows good scalability in the FDR cluster while the MQ algorithms degrade.…”
Section: Throughput When Scaling Outmentioning
confidence: 99%
See 1 more Smart Citation
“…For the case where the result is larger than a private cache, but smaller than the combined shared cache of all threads, Cieslewicz and Ross [11] show that SHAREDAGGREGATION may be a better solution than the other two, which uses uses a shared (lock-free) hash table, at least in the absence of skew. Similar techniques have been proposed for JOIN and SORT operators [4,5,6,7,9]. As we show in this paper, these techniques alone are not sufficient for reproducible floating-point numbers.…”
Section: Related Workmentioning
confidence: 73%
“…As a first solution for reproducible floating-point aggregation with GROUPBY, we propose a data type that can be used as drop-in replacement for intermediate aggregates of floatingpoint numbers in any state-of-the-art aggregation algorithm with little to no modification. 7 We base this type on the…”
Section: A Reproducible Floating-point Typementioning
confidence: 99%