2011
DOI: 10.1109/tpds.2010.115
|View full text |Cite
|
Sign up to set email alerts
|

Design and Performance Evaluation of Image Processing Algorithms on GPUs

Abstract: Abstract-In this paper, we construe key factors in design and evaluation of image processing algorithms on the massive parallel graphics processing units (GPUs) using the compute unified device architecture (CUDA) programming model. A set of metrics, customized for image processing, is proposed to quantitatively evaluate algorithm characteristics. In addition, we show that a range of image processing algorithms map readily to CUDA using multiview stereo matching, linear feature extraction, JPEG2000 image encod… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
47
0
3

Year Published

2011
2011
2017
2017

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 124 publications
(50 citation statements)
references
References 28 publications
0
47
0
3
Order By: Relevance
“…When kernel functions run N times, they work in parallel on N channels. This property of kernel separates from familiar C functions [15], [16]. The primary parallel construct is a data-parallel, SPMD kernel function.…”
Section: Parallel Implementationmentioning
confidence: 99%
“…When kernel functions run N times, they work in parallel on N channels. This property of kernel separates from familiar C functions [15], [16]. The primary parallel construct is a data-parallel, SPMD kernel function.…”
Section: Parallel Implementationmentioning
confidence: 99%
“…Second, the performance and scalability of the proposed approach will be evaluated by performing a comparison between the CSX600-based implementation and the fastest host only version using OpenMP on a state-of-theart multiprocessor server system featuring a total of 8 system core. Similar experiments using OpenMP have been used in image processing algorithms on GPUs [15]. Third, the experiments will provide some initial evidence for the claim that the proposed AP-based ATC system implementation exhibits greater efficiency and a huge increase in the degree of predictability than the MIMD-based solution.…”
Section: Resultsmentioning
confidence: 85%
“…However, the particular case of windowed computation has not received much attention in an analytical framework. On the other hand there is a lot of work on implementing image and windowed computations on particular platforms (notably FPGAs and GPUs) [1,2,5,8,11,12]. To our knowledge, ours is the first work to analytically study windowed computations and their execution on a pipelined platform of computation.…”
Section: Introductionmentioning
confidence: 99%