Proceedings of the 2011 ACM SIGMOD International Conference on Management of Data 2011
DOI: 10.1145/1989323.1989444
|View full text |Cite
|
Sign up to set email alerts
|

Automated partitioning design in parallel database systems

Abstract: In recent years, Massively Parallel Processors (MPPs) have gained ground enabling vast amounts of data processing. In such environments, data is partitioned across multiple compute nodes, which results in dramatic performance improvements during parallel query execution. To evaluate certain relational operators in a query correctly, data sometimes needs to be re-partitioned (i.e., moved) across compute nodes. Since data movement operations are much more expensive than relational operations, it is crucial to de… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
65
0
5

Year Published

2012
2012
2021
2021

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 94 publications
(72 citation statements)
references
References 19 publications
2
65
0
5
Order By: Relevance
“…5, the search tree's levels are split into two sections corresponding to the two search phases. In the first phase, Horticulture explores the tables' candidate attributes using a branch-and-bound search [49,32]. Once all of the relaxed tables are assigned an attribute in D relax , Horticulture then performs a brute-force search in the second phase to select the stored procedures' routing parameters.…”
Section: Local Searchmentioning
confidence: 99%
See 3 more Smart Citations
“…5, the search tree's levels are split into two sections corresponding to the two search phases. In the first phase, Horticulture explores the tables' candidate attributes using a branch-and-bound search [49,32]. Once all of the relaxed tables are assigned an attribute in D relax , Horticulture then performs a brute-force search in the second phase to select the stored procedures' routing parameters.…”
Section: Local Searchmentioning
confidence: 99%
“…Our cost model must also calculate estimates for designs where not all of the tables and procedures have been assigned an attribute yet [32]. This allows Horticulture to determine whether an incomplete design has a greater cost than the current best design, and thus allows it to skip exploring the remainder of the search tree below its current location.…”
Section: Incomplete Designsmentioning
confidence: 99%
See 2 more Smart Citations
“…Automatic database partitioning for distributed databases has recently received further attention. In [14], data is partitioned automatically to optimize the execution of MPP systems. As a possible alternative they only consider hash-based partitioning over a single column.…”
Section: Effect Of Imbalance Factor and Data Correlationmentioning
confidence: 99%