Proceedings of the 38th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems 2019
DOI: 10.1145/3294052.3319698
|View full text |Cite
|
Sign up to set email alerts
|

Instance and Output Optimal Parallel Algorithms for Acyclic Joins

Abstract: Massively parallel join algorithms have received much attention in recent years, while most prior work has focused on worst-optimal algorithms. However, the worst-case optimality of these join algorithms relies on hard instances having very large output sizes, which rarely appear in practice. A stronger notion of optimality is output-optimal, which requires an algorithm to be optimal within the class of all instances sharing the same input and output size. An even stronger optimality is instance-optimal, i.e.,… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
11
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
3
3

Relationship

0
6

Authors

Journals

citations
Cited by 18 publications
(11 citation statements)
references
References 46 publications
0
11
0
Order By: Relevance
“…The fourth class comprises all the acyclic queries, which were recently solved by Hu [8] optimally. It is worth pointing out that Hu's algorithm subsumes an earlier algorithm of [9] which is worst-case optimal on a subclass of acyclic queries.…”
Section: Previous Resultsmentioning
confidence: 99%
“…The fourth class comprises all the acyclic queries, which were recently solved by Hu [8] optimally. It is worth pointing out that Hu's algorithm subsumes an earlier algorithm of [9] which is worst-case optimal on a subclass of acyclic queries.…”
Section: Previous Resultsmentioning
confidence: 99%
“…In our vertex-centric join, the graph representation of the input database is never reshuffled, regardless of the query. The generalization of parallel sort-join algorithm to any acyclic join is presented in [33], and has a total communication cost of 𝑂 (𝐼 𝑁 + √ 𝐼 𝑁 • 𝑂𝑈𝑇 ). This outperforms our algorithm and parallel hash-join when the join output blows up to be lager than the input, but it is worse for selective joins and it is equivalent when the query involves only PK-FK joins.…”
Section: Comparison To Other Algorithms Reference [35] Describes a Ha...mentioning
confidence: 99%
“…On the theoretical front, we show that TAG-join's communication and computation complexities are competitive with those of the best-known parallel join algorithms [10,15,16,32,33,35] while avoiding the relation reshuffling these algorithms require Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…A good database scheme design would allow information to be retrieved easily and efficiently [10]. Acyclic databases are preferred due to the variety of desired computational properties it enjoys such as making query optimization easier than in the case of cyclic database and might be recognized in linear time [11][12][13][14][15][16][17].…”
Section: Introductionmentioning
confidence: 99%