2015
DOI: 10.1002/cpe.3611
|View full text |Cite
|
Sign up to set email alerts
|

CUDA‐quicksort: an improved GPU‐based implementation of quicksort

Abstract: Summary Sorting is a very important task in computer science and becomes a critical operation for programs making heavy use of sorting algorithms. General‐purpose computing has been successfully used on Graphics Processing Units (GPUs) to parallelize some sorting algorithms. Two GPU‐based implementations of the quicksort were presented in literature: the GPU‐quicksort, a compute‐unified device architecture (CUDA) iterative implementation, and the CUDA dynamic parallel (CDP) quicksort, a recursive implementatio… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
14
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
4
3

Relationship

1
6

Authors

Journals

citations
Cited by 21 publications
(14 citation statements)
references
References 19 publications
0
14
0
Order By: Relevance
“…Clustering is performed by sorting the prefixes of the read sequences with our GPU-based CUDA-Quicksort [25]. As CUDA-Quicksort sorts numerical values, it is necessary to encode the prefixes of the read sequences.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Clustering is performed by sorting the prefixes of the read sequences with our GPU-based CUDA-Quicksort [25]. As CUDA-Quicksort sorts numerical values, it is necessary to encode the prefixes of the read sequences.…”
Section: Methodsmentioning
confidence: 99%
“…The comparative assessment has been made in the task of sorting items with long keys -characterized by 19 digits (i.e., the maximum number of digits used to represent the encoded read prefixes). Experiments, performed ensuring a uniform distribution on benchmark datasets (with varying size from 1M to 32M elements), show that CUDA-Quicksort outperforms Thrust Radix Sort with a speed-up ranging from 1.58x to 2.18x, depending on the dataset at hand [25]. …”
Section: Methodsmentioning
confidence: 99%
“…Quicksort [23] is based on a partitioning operation: Firstly, this algorithm divides a large array into two short sub-arrays: the lower elements and the higher elements. It is divided into different steps: 1.…”
Section: Quicksortmentioning
confidence: 99%
“…Manca et al . propose CUDA‐quicksort, an iterative GPU‐based implementation of the sorting algorithm. The quicksort that they propose is based on the GPU‐quicksort implementation, wherein the process has two major steps.…”
Section: Parallel Multi‐key Quicksortmentioning
confidence: 99%
“…When the size of input is very small and when no more quicksort can be applied, then a bi-tonic sort [31] is applied to get the final result. Manca et al [29] propose CUDA-quicksort, an iterative GPU-based implementation of the sorting algorithm. The quicksort that they propose is based on the GPU-quicksort implementation, wherein the process has two major steps.…”
Section: Cpu Parallel Multi-key Quicksortmentioning
confidence: 99%