Proceedings of the Fifteenth Annual ACM Symposium on Parallel Algorithms and Architectures 2003
DOI: 10.1145/777412.777435
|View full text |Cite
|
Sign up to set email alerts
|

Asynchronous parallel disk sorting

Abstract: We develop an algorithm for parallel disk sorting, whose I/O cost approaches the lower bound and that guarantees almost perfect overlap between I/O and computation. Previous algorithms have either suboptimal I/O volume or cannot guarantee that I/O and computations can always be overlapped. We give an efficient implementation that can (at least) compete with the best practical implementations but gives additional performance guarantees. For the experiments we have configured a state of the art machine that can … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
28
1

Year Published

2005
2005
2008
2008

Publication Types

Select...
3
2
2

Relationship

3
4

Authors

Journals

citations
Cited by 36 publications
(29 citation statements)
references
References 25 publications
0
28
1
Order By: Relevance
“…The second kind of sorting algorithms on the PDM are based on R-way merging for some suitable value of R that minimizes the number of passes through the data for the given size of internal memory [1,6,11,16,18].…”
Section: Prior Algorithms and Our Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…The second kind of sorting algorithms on the PDM are based on R-way merging for some suitable value of R that minimizes the number of passes through the data for the given size of internal memory [1,6,11,16,18].…”
Section: Prior Algorithms and Our Resultsmentioning
confidence: 99%
“…The RC scheduling resulted in optimal distributed sort (RCD) and optimal mergesort (RCM) via duality [13]. These have been shown to be very practical [11]. However, these algorithms have been shown to be optimal only in expectation and no high probability bounds have been derived.…”
Section: Prior Algorithms and Our Resultsmentioning
confidence: 99%
“…Our experimental platform has two 2.0 GHz Intel Xeon processors, one GByte of RAM, and we use four 80 GByte IBM 120GXP disks. Refer to [11] for a performance evaluation of this machine whose cost was 2500 Euro in July 2002. The following instances have been considered: Random2: Two concatenated copies of a Random string of length n/2.…”
Section: Methodsmentioning
confidence: 99%
“…Both sorters are highly efficient parallel disk implementations. The algorithm they implement guarantees close to optimal I/O volume and almost perfect overlapping between I/O and computation [16]. The performance of the sorters scales well.…”
Section: Algorithmsmentioning
confidence: 96%
“…The input of the sorter may be an object complying to Stxxl stream interface. As the STL-user layer sorter, the pipelined sorter is an implementation of parallel disk merge sort [16] that overlaps I/O and computation. The implementation of stream::sort relies on two classes that encapsulate the two phases of the algorithm: sorted run formation (class runs creator) and run merging (runs merger).…”
Section: Streaming Layermentioning
confidence: 99%