Proceedings 20th IEEE International Parallel &Amp; Distributed Processing Symposium 2006
DOI: 10.1109/ipdps.2006.1639284
|View full text |Cite
|
Sign up to set email alerts
|

GPU-ABiSort: optimal parallel sorting on stream architectures

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

1
37
0
1

Year Published

2008
2008
2022
2022

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 44 publications
(39 citation statements)
references
References 10 publications
1
37
0
1
Order By: Relevance
“…Kipfer et al [7,8] have shown an improved version of the bitonic sort as well as an odd-even merge sort. Greß et al [9] introduced an approach based on the adaptive bitonic sorting technique presented by Bilardi et al [10]. Govindaraju et al [11] implemented a sorting solution based on the periodic balanced sorting network method by Dowd et al [12] and one based on bitonic sort [13].…”
Section: Introductionmentioning
confidence: 99%
“…Kipfer et al [7,8] have shown an improved version of the bitonic sort as well as an odd-even merge sort. Greß et al [9] introduced an approach based on the adaptive bitonic sorting technique presented by Bilardi et al [10]. Govindaraju et al [11] implemented a sorting solution based on the periodic balanced sorting network method by Dowd et al [12] and one based on bitonic sort [13].…”
Section: Introductionmentioning
confidence: 99%
“…Gress and Zachmann [4] present results of up to 1M elements, and then report a 37% performance improvement on the GPUSort algorithm on a GeForce 6800 based system, but only about 5% on a Geforce 7800 system. Our algorithm performs more than twice as fast as the GPUSort algorithm for arrays of four million elements and more.…”
Section: Millisecondsmentioning
confidence: 99%
“…It is, however, possible to modify bitonic sort to perform in O(n log n). GPU-ABiSort by Greß and Zachmann [4] utilizes Adaptive Bitonic Sorting [5], where the key is to use a bitonic tree, when merging two bitonic sequences, to rearrange the data to obtain a linear number of comparisons for the merge, instead of the n log n comparisons required by the standard bitonic sort [6]. This lowers the total complexity of Adaptive Bitonic Sorting to n log n. Greß and Zachmann thereby report slightly faster timings than Govindaraju [2] for their tested 32-bit streams of up to 1M elements.…”
Section: Introductionmentioning
confidence: 99%
“…Moreover, degree of parallelism is reduced as higher levels are sorted and thus not fully utilizing parallel GPU architecture. An adaptive bitonic-scheme is proposed in [12]. Their technique sorts n values using p stream processors achieving optimum complexity of ( ) p n n log Ο .…”
Section: Related Workmentioning
confidence: 99%