2015
DOI: 10.5281/zenodo.40565
|View full text |Cite
|
Sign up to set email alerts
|

Scikit-Cuda 0.5.1

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2017
2017
2018
2018

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 0 publications
0
3
0
Order By: Relevance
“…The original CPU code was written in Python for easy integration with other data acquisition, analysis, and visualization tools provided at NSLS-II. Therefore, in our port to NVIDIA GPUs we use PyCUDA (wrapper for CUDA API) [19], scikit-cuda (for cuFFT and other CUDA libraries) [20] and MPI for Python (mpi4py) [21]- [23] to accelerate the existing Python code. In addition, most of the computation are rewritten as CUDA C kernels, which are called through the PyCUDA binding for the best performance.…”
Section: Detailsmentioning
confidence: 99%
“…The original CPU code was written in Python for easy integration with other data acquisition, analysis, and visualization tools provided at NSLS-II. Therefore, in our port to NVIDIA GPUs we use PyCUDA (wrapper for CUDA API) [19], scikit-cuda (for cuFFT and other CUDA libraries) [20] and MPI for Python (mpi4py) [21]- [23] to accelerate the existing Python code. In addition, most of the computation are rewritten as CUDA C kernels, which are called through the PyCUDA binding for the best performance.…”
Section: Detailsmentioning
confidence: 99%
“…All matrix operations are conducted via the Numpy package (compiled against efficient matrix libraries). For the GPU implementation, we make use of the scikit-cuda package [10] and of custom CUDA kernels, called from Python via the PyCuda package [13].…”
Section: Setupmentioning
confidence: 99%
“…In figs. 8 and 9, the performance of Bifrost is compared over various parameter settings with a serial pipeline implemented with the package scikit-cuda 20 (Givon et al, 2015). Both scikit-cuda and Bifrost have Python frontends, wrapping a cuFFT back end, meaning that differences in performance should measure the speedup due to pipeline parallelism, or slowness due to Python overhead.…”
Section: Performancementioning
confidence: 99%