1993
DOI: 10.1006/jpdc.1993.1008
|View full text |Cite
|
Sign up to set email alerts
|

Large-Scale Sorting in Uniform Memory Hierarchies

Abstract: Abstract. We present several e cient algorithms for sorting on the uniform memory hierarchy UMH, introduced by Alpern, Carter, and Feig, and its parallelization P-UMH. We give optimal and nearly-optimal algorithms for a wide range of bandwidth degradations, including a parsimonious algorithm for constant bandwidth. We also develop optimal sorting algorithms for all bandwidths for other versions of UMH and P-UMH, including natural restrictions we i n troduce called RUMH and P-RUMH, which more closely correspond… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
11
0

Year Published

1994
1994
2008
2008

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 26 publications
(11 citation statements)
references
References 6 publications
0
11
0
Order By: Relevance
“…Then it can be applied separately to each cache level, noting that the data transfer in the higher levels do not contribute for any given level. ✷ These lower bounds are in the same spirit as those of Vitter and Nodine [32] (for the S-UMH model) and Savage [28], that is, the lower bounds do not capture the simultaneous interaction of the different levels.…”
Section: Theorem 55 the Lower Bound For Sorting In The Restricted Mumentioning
confidence: 91%
See 1 more Smart Citation
“…Then it can be applied separately to each cache level, noting that the data transfer in the higher levels do not contribute for any given level. ✷ These lower bounds are in the same spirit as those of Vitter and Nodine [32] (for the S-UMH model) and Savage [28], that is, the lower bounds do not capture the simultaneous interaction of the different levels.…”
Section: Theorem 55 the Lower Bound For Sorting In The Restricted Mumentioning
confidence: 91%
“…Bilardi and Peserico [8] investigate further the complexity of designing algorithms without the knowledge architectural parameters. 2 Other attempts were directed towards extracting better performance by parallel memory hierarchies [32,33,14], where several blocks could be transferred simultaneously.…”
Section: Related Workmentioning
confidence: 99%
“…Nodine and Vitter [22] describe several efficient sorting algorithms for the parallel disk model. Interestingly, Nodine and Vitter [21] also consider a multiprocessor version of the parallel disk model, but not in a way that is appropriate for multicores, since they assume that the processors are interconnected via a PRAM-type network or share the entire internal memory (see, e.g., [28,29,30]). Assuming that processors share internal memory does not fit the current practice of multicore.…”
Section: Introductionmentioning
confidence: 99%
“…These include both methods to improve the rate of I/O delivery to uniprocessor systems by i n troducing parallelism into the I/O subsystem, and methods of improving the I/O performance of multiprocessors. At the highest level, new theoretical models of parallel I/O systems are being developed 1,33,25,32], allowing the study of many fundamental algorithms in terms of their I/O complexity. A t the next level, new language and compiler features are being developed to support I/O parallelism and optimizations, using data layout conversion 12] and compiler hints 29].…”
Section: Introductionmentioning
confidence: 99%