1991
DOI: 10.1007/3-540-54132-2_49
|View full text |Cite
|
Sign up to set email alerts
|

An overview of parallel strategies for transitive closure on algebraic machines

Abstract: An important feature of database technology of the nineties is the use of distributed computation for speeding up the execution of complex queries. Today, the use of parallelism is tested in several experimental database architectures and a few commercial systems for conventional select-project-join queries. In particular, hash-based fragmentation is used to distribute data to disks under the control of different processors, in multi-processor architectures without shared memory, in order to perform selections… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

1992
1992
2020
2020

Publication Types

Select...
3
3
2

Relationship

1
7

Authors

Journals

citations
Cited by 14 publications
(6 citation statements)
references
References 22 publications
0
6
0
Order By: Relevance
“…However, there has not been much work yet on investigating the tradeoffs using newer SQL-on-Hadoop solutions. We believe that, for our scenario, a semi-naive evaluation [8] is the better choice as it distributes the workload over more rounds and produce less derivations on graphs with cycles [3]. In contrast, a smart TC algorithm based on a nonlinear (recursivedoubling) execution [12] uses logarithmic, rather than linear, number of rounds but with much higher costs (with regard to the data-volume) per round [3].…”
Section: Query Compilermentioning
confidence: 98%
See 1 more Smart Citation
“…However, there has not been much work yet on investigating the tradeoffs using newer SQL-on-Hadoop solutions. We believe that, for our scenario, a semi-naive evaluation [8] is the better choice as it distributes the workload over more rounds and produce less derivations on graphs with cycles [3]. In contrast, a smart TC algorithm based on a nonlinear (recursivedoubling) execution [12] uses logarithmic, rather than linear, number of rounds but with much higher costs (with regard to the data-volume) per round [3].…”
Section: Query Compilermentioning
confidence: 98%
“…Fortunately, we can reduce such an expression to the problem of calculating the transitive closure (TC), which is a well-studied research field [8,12]. There is an ongoing debate whether the so-called semi-naive or smart TC algorithm is superior in distributed environments like MapReduce [2,20].…”
Section: Query Compilermentioning
confidence: 99%
“…At this point, we estimate that the maximum number of iterations has been reached and that the iteration terminates. This estimation relies on several assumptions, that are inspired by the so-called semi-naïve evaluation of transitive closures found in the literature [1,5,7,9]. In particular, we assume that only the new results generated by an iteration are used for the next iteration and that the number of tuples reduces until a maximum number of iterations N is reached.…”
Section: Fixpoint Operatormentioning
confidence: 99%
“…Recursive queries expresses a category of complex queries that involve iterative application of a function or operation until some condition is satisfied -known as the fixpoint. A variety of studies has been conducted on this class of queries including [5,9,11] and more recently [7,10,14]. One of the most difficult tasks in estimating the cost of a recursive query is determining the number of iterative steps needed for the iteration to converge.…”
Section: Introductionmentioning
confidence: 99%
“…Although in the context of PRISMA distributed transitive closure algorithms are very interesting, we will not go into this now. A good overview of parallel strategies for the transitive closure operation may be found in [8,9].…”
Section: Transitive Closurementioning
confidence: 99%