1993
DOI: 10.1109/71.224220
|View full text |Cite
|
Sign up to set email alerts
|

Balanced parallel sort on hypercube multiprocessors

Abstract: A parallel sorting algorithm for sorting n elements evenly distributed over Zd = p nodes of a d-dimensional hypercube is presented. The average running time of the algorithm is O((n log n)/p + p log2 n). The algorithm maintains a perfect load balance in the nodes by determining the (kn/p)th elements (k = 1,.. . , (p-1)) of the final sorted list in advance. These p-1 keys are used to partition the sorted sublists in each node to redistribute data to the nodes to be merged in parallel. The nodes finish the sort … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
10
0

Year Published

1994
1994
2022
2022

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 21 publications
(10 citation statements)
references
References 12 publications
0
10
0
Order By: Relevance
“…where γ d is defined as in (3), Γ(v, d) has to fulfill the constraint specified by (1), and pa = V e a , C e a is a valid design transformation of EPN e.…”
Section: Optimization Problemmentioning
confidence: 99%
See 1 more Smart Citation
“…where γ d is defined as in (3), Γ(v, d) has to fulfill the constraint specified by (1), and pa = V e a , C e a is a valid design transformation of EPN e.…”
Section: Optimization Problemmentioning
confidence: 99%
“…Replicating processes increases data parallelism and structural unfolding of a process increases the task and pipeline parallelism by hierarchically instantiating more processes in the process network. Furthermore, as recursive algorithms are commonly used in mathematical [1] and multimedia [2] applications, we study the recursive description of processes as a structural unfolding method.…”
Section: Introductionmentioning
confidence: 99%
“…Under such a policy, compulsory misses (as well as coherence misses) become significant. There is extensive research in the development of algorithms to minimize the amount of communication needed on traditional architectures and in the development of new architectures to either reduce the communication or overlap several communication activities [19,[26][27][28][29][30][31][32][33][34]. However, communication time remains a significant fraction of total run time due to the large diameter of the topologies and the limited connectivity between processors.…”
Section: Some-bus Architecture Enhancementsmentioning
confidence: 99%
“…Most parallel sorting algorithms have been developed either in the context of PRAM models, e.g. [9][10][11], or network models [12][13][14]. These algorithms typically assume a large number of processors (comparable to the number of data elements) and either neglect communication costs (PRAM algorithms) or rely on a specific machine structure (network-based algorithms).…”
Section: The Performance Of a Selection Of Sorting Algorithmsmentioning
confidence: 99%