2016
DOI: 10.1007/s11554-016-0619-6
|View full text |Cite
|
Sign up to set email alerts
|

Acceleration techniques and evaluation on multi-core CPU, GPU and FPGA for image processing and super-resolution

Abstract: Super-Resolution (SR) techniques constitute a key element in image applications, which need highresolution reconstruction while in the worst case only a single low-resolution observation is available. SR techniques involve computationally demanding processes and thus researchers are currently focusing on SR performance acceleration. Aiming at improving the SR performance, the current paper builds up on the characteristics of the L-SEABI Super-Resolution (SR) method to introduce parallelization techniques for G… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
16
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 32 publications
(16 citation statements)
references
References 43 publications
0
16
0
Order By: Relevance
“…Specifically, our stereo matcher concerns bidirectional disparity map generation with 7 × 7 Gauss-weighted cost aggregation, and its architecture is detailed in [72]. For one pair of 1120 × 1120 8-bit pixel images with 200 disparity levels, a four-engine parallelization on Virtex6 VLX240T requires 0.54 s. Therefore, the FPGA is 29x faster than Intel i5 (see [89] for detailed comparison) and approximately 7700x faster than space-grade LEON3; this factor was derived after extrapolation from smaller images due to memory limitations on LEON3. Such huge factors can be also derived for Harris detection and scale-invariant feature transform (SIFT) description.…”
Section: Field-programmable Gate Arraymentioning
confidence: 99%
See 1 more Smart Citation
“…Specifically, our stereo matcher concerns bidirectional disparity map generation with 7 × 7 Gauss-weighted cost aggregation, and its architecture is detailed in [72]. For one pair of 1120 × 1120 8-bit pixel images with 200 disparity levels, a four-engine parallelization on Virtex6 VLX240T requires 0.54 s. Therefore, the FPGA is 29x faster than Intel i5 (see [89] for detailed comparison) and approximately 7700x faster than space-grade LEON3; this factor was derived after extrapolation from smaller images due to memory limitations on LEON3. Such huge factors can be also derived for Harris detection and scale-invariant feature transform (SIFT) description.…”
Section: Field-programmable Gate Arraymentioning
confidence: 99%
“…Besides Harris, we also use image superresolution (SR) and stereo matching (disparity [72]) to compare desktop GPUs to FPGAs. We tested a variety of devices and performed in-depth optimization of the implemented benchmarks on both platforms; a detailed analysis has been published separately [89]. Here, we consider Nvidia GTX 670/960, whereas for the FPGA (Zynq7000 Artix), we consider both processing and communication time, in contrast to the study in [89] that focuses on processing.…”
Section: Graphics Processing Unitmentioning
confidence: 99%
“…Assuming that the motion of adjacent frames is uniformly varying, according to Equations (4) and (5), the dynamic description model A and the observation model H are given by…”
Section: Motion Trajectory Filtering Based On Kalman Filtermentioning
confidence: 99%
“…Through the image stabilization technology, the impact of the moving camera on the image can be eliminated or reduced, which greatly improves the quality of image sequences. In addition, with the development of large area scientific imaging arrays, as well as the high speed processing elements such as DSP (digital signal processing), FPGA (field programmable gate array), GPU (graphics processing unit) and CUDA (compute unified device architecture), real-time image processing technology for high resolution images has become a hot topic in recent years [4,5], while real-time electronic image stabilization technology as one of the significant image processing techniques has also been extensively studied.…”
Section: Introductionmentioning
confidence: 99%
“…In this context, the performance of multi-core CPUs, GPUs and FPGAs have been evaluated in multiple application domain like image processing [17] and computer graphics [18]. As opposed to this approach, HCS evaluate performance results in terms of cooperative work across computing units.…”
Section: Introductionmentioning
confidence: 99%