2000
DOI: 10.1007/3-540-44469-6_60
|View full text |Cite
|
Sign up to set email alerts
|

A Skew-Insensitive Algorithm for Join and Multi-join Operations on Shared Nothing Machines

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
27
0

Year Published

2005
2005
2009
2009

Publication Types

Select...
5
1

Relationship

2
4

Authors

Journals

citations
Cited by 14 publications
(27 citation statements)
references
References 5 publications
0
27
0
Order By: Relevance
“…Our experience with the BSP cost model and the tests presented in our previous papers (Bamha and Hains, 1999;Bamha and Hains, 2000;Hassan and Bamha, 2008) prove the effectiveness of our approach compared to standard hash-join pipelined algorithms.…”
Section: Resultsmentioning
confidence: 55%
See 2 more Smart Citations
“…Our experience with the BSP cost model and the tests presented in our previous papers (Bamha and Hains, 1999;Bamha and Hains, 2000;Hassan and Bamha, 2008) prove the effectiveness of our approach compared to standard hash-join pipelined algorithms.…”
Section: Resultsmentioning
confidence: 55%
“…Data skew can have a disastrous effect on performance (Mourad et al, 1994;Hua and Lee, 1991;DeWitt et al, 1992;Bamha and Hains, 2000;Bamha and Hains, 1999) due to the high costs of communications and synchronizations in this architecture.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…For this reason, parallel processing of these queries is highly recommended in order to obtain acceptable response time (Bamha, 2005). Research has shown that join, which is one of the most expensive operations in DBMS, is parallelizable with near-linear speed-up only in ideal cases (Bamha and Hains, 2000). However, data skew degrades the performance of parallel systems (Bamha and Hains, 1999;Bamha and Hains, 2000;Seetha and Yu, 1990;Hua and Lee, 1991;Wolf et al, 1994;.…”
Section: Introductionmentioning
confidence: 99%
“…Research has shown that join, which is one of the most expensive operations in DBMS, is parallelizable with near-linear speed-up only in ideal cases (Bamha and Hains, 2000). However, data skew degrades the performance of parallel systems (Bamha and Hains, 1999;Bamha and Hains, 2000;Seetha and Yu, 1990;Hua and Lee, 1991;Wolf et al, 1994;. Thus, effective parallel algorithms that evenly distribute the load among processors and minimizes the inter-site communication must be employed in parallel and distributed systems in order to obtain acceptable performance.…”
Section: Introductionmentioning
confidence: 99%