2000
DOI: 10.1117/1.482755
|View full text |Cite
|
Sign up to set email alerts
|

Mapping of two-dimensional convolution on very long instruction word media processors for real-time performance

Abstract: Programmable media processors have been emerging to meet the continuously increasing computational demand in complex digital media applications, such as HDTV and MPEG-4, at an affordable cost. These media processors provide the flexibility to implement various image computing algorithms along with high performance, unlike the hardwired approach that has provided high performance for a particular algorithm, but lacks flexibility. However, to achieve high performance on these media processors, a careful and some… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
20
0

Year Published

2001
2001
2010
2010

Publication Types

Select...
4
3

Relationship

1
6

Authors

Journals

citations
Cited by 18 publications
(20 citation statements)
references
References 11 publications
0
20
0
Order By: Relevance
“…While the core processor computes on the data in the current cache block, the DMA controller stores the previously computed block into external memory and brings the next block to be processed from external memory to on-chip memory, thus overlapping the time required to move the data with the computing time. Details of using the DMA controller can be found in [14]. This example illustrates the computational advantage of using boxcar kernels compared to using generalized kernels, especially for the case requiring large kernels/low cutoff frequencies.…”
Section: Emphasis Gain Controlmentioning
confidence: 96%
See 2 more Smart Citations
“…While the core processor computes on the data in the current cache block, the DMA controller stores the previously computed block into external memory and brings the next block to be processed from external memory to on-chip memory, thus overlapping the time required to move the data with the computing time. Details of using the DMA controller can be found in [14]. This example illustrates the computational advantage of using boxcar kernels compared to using generalized kernels, especially for the case requiring large kernels/low cutoff frequencies.…”
Section: Emphasis Gain Controlmentioning
confidence: 96%
“…For each point in the output image, a total of M 2 -1 additions must be performed. An efficient moving average method [14,19] …”
Section: Two-dimensional Boxcar Convolutionmentioning
confidence: 99%
See 1 more Smart Citation
“…The IGALU also supports powerful instructions for filtering operations used extensively in image and video applications. For example, a single inner-product instruction can perform eight 16-bit multiplications in parallel, summing the results into a 32-bit output (Managuli et al, 2000). The two clusters together are capable of executing 4 different instructions (e.g., 2 on IALUs and 2 on IGALUs) in each clock cycle.…”
Section: Implementation On a Mediaprocessormentioning
confidence: 99%
“…Data flow programming in mediaprocessors is typically handled in two ways: programmable DMA (direct memory access) engines and cache prefetching [7]. Implementing data flow using a DMA engine involves software development around the core computation tight loop.…”
Section: Universal Data Flow Code Generatormentioning
confidence: 99%