2013 International Conference on Advanced Technologies for Communications (ATC 2013) 2013
DOI: 10.1109/atc.2013.6698140
|View full text |Cite
|
Sign up to set email alerts
|

Heterogeneous hardware accelerator architecture for streaming image processing

Abstract: This paper proposes a heterogeneous hardware accelerator architecture to support streaming image processing. Each image in a data-set is pre-processed on a host processor and sent to hardware kernels. The host processor and the hardware kernels process a stream of images in parallel. The Convey hybrid computing system is used to develop our proposed architecture. We use the Canny edge detection algorithm as our case study. The data-set used for our experiment contains 7200 images. Experimental results show tha… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2013
2013
2015
2015

Publication Types

Select...
1
1

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 20 publications
0
2
0
Order By: Relevance
“…The presented system is capable of processing four independent data streams in parallel. A very similar project but intended for images is presented in [14]. The authors of this project keep the data-set of images on their hard drive and send them to the hardware accelerators implemented in FPGA for further processing.…”
Section: Fpgas For Hardware Acceleration Of Image and Video Processingmentioning
confidence: 99%
“…The presented system is capable of processing four independent data streams in parallel. A very similar project but intended for images is presented in [14]. The authors of this project keep the data-set of images on their hard drive and send them to the hardware accelerators implemented in FPGA for further processing.…”
Section: Fpgas For Hardware Acceleration Of Image and Video Processingmentioning
confidence: 99%
“…Phases 3,4,6, and 7 in the NoC-based interconnect system are shorter than in the baseline system due to data movement through the NoC. While all K1 out-put (D H 1(out ) and D K 1(out ) ) is copied back to the main memory in the baseline system in Phase 3, only part of this output (D H 1(out ) ) is copied to the main memory in the NoC-based interconnect system because data output consumed by K2 and K3 (D K 1(out ) ) is transferred to K2 and K3 by the NoC in Phase 2 (parallel with K1 execution).…”
Section: Modeling Noc-based Interconnectmentioning
confidence: 99%