H.264/AVC interpolation optimization

Sihvo, T.; Niittylahti, J.

doi:10.1109/sips.2005.1579884

Cited by 4 publications

(1 citation statement)

References 8 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The horizontal and vertical filtering with dot product instructions can generate eight samples at each iteration, and achieve the iteration interval of 8 cycles and 11 cycles, respectively. Although the use of dot product instructions seems less efficient than using shift and addition instructions [22,23], the dotproduct implementation shows better performances, 3.7 times faster for the horizontal filtering and 6.1 times faster for the vertical filtering because it utilizes the hardware multipliers.…”

Section: Vliw-simd and Program Optimizationmentioning

confidence: 99%

Algorithm and Software Optimization of Variable Block Size Motion Estimation for H.264/AVC on a VLIW–SIMD DSP

Lee

Choi

Sung

2008

J Sign Process Syst Sign Image

View full text Add to dashboard Cite

We implemented the H.264/AVC variable block size motion estimation (VBSME) using a very long instruction word (VLIW)-single instruction multiple data (SIMD) digital signal processor (DSP). The SAD_Reuse method which has a regular structure is chosen for VBSME not only to remove redundant sum of absolute difference (SAD) operations but also to utilize the instruction level parallelism (ILP) and data level parallelism (DLP) of the architecture. A fast mode decision algorithm is developed to reduce the number of 'compare and update' operations and simplify the rate distortion optimization (RDO). The developed fast mode decision uses the difference of motion vectors and the maximum a posteriori (MAP) estimation of the rate-distortion costs. Several advanced software techniques that include software pipelining and packed-data processing are employed. Especially, memory access overhead reduction schemes including the multi-block processing and the inter-procedural scheduling are used for the software optimization. In order to reduce the 'write buffer full' in the quarter pixel ME, a 4 bit quantization scheme is developed, which increases the number of arithmetic operations but decreases the stall cycles very much. The implemented variable block size ME for H.264/ AVC requires an average of 9 M and 78 Mcycles per frame for QCIF and CIF size video sequences, respectively, in the TMS320C64x DSP architecture.

show abstract

Section: Vliw-simd and Program Optimizationmentioning

confidence: 99%

Algorithm and Software Optimization of Variable Block Size Motion Estimation for H.264/AVC on a VLIW–SIMD DSP

Lee

Choi

Sung

2008

J Sign Process Syst Sign Image

View full text Add to dashboard Cite

show abstract

An Efficient VLSI Architecture of Fractional Motion Estimation in H.264 for HDTV

Ruiz

Michell

2010

J Sign Process Syst

View full text Add to dashboard Cite

Fractional Motion Estimation (FME) in highdefinition H.264 presents a significant design challenge in terms of memory bandwidth, latency and area cost as there are various modes and complex mode decision flow, which require over 45% of the computation complexity in the H.264 encoding process. In this paper, a new highperformance VLSI architecture for Fractional Motion Estimation (FME) in H.264/AVC based on the full-search algorithm is presented. This architecture is made up of three different pipeline processors to establish a trade-off between processing time and hardware utilization. The computing scheme based on a 4-pixel interpolation unit with a 10-pixel input bandwidth is capable of processing a macroblock (MB) in 870 clock cycles. The final VLSI implementation only requires 11.4 k gates and 4.4kBytes of RAM in a standard 180 nm CMOS technology operating at 290 MHz. Our design generates the residual image and the best MVs and mode in a high throughput and low area cost architecture while achieving enough processing capacity for 1080HD (1920×1088@30fps) real-time video streams.

show abstract

Lossless interpolation optimization for H.264 decoder

Wang

2010

2010 IEEE Asia Pacific Conference on Circuits and Systems

View full text Add to dashboard Cite

H.264/AVC interpolation optimization

Cited by 4 publications

References 8 publications

Algorithm and Software Optimization of Variable Block Size Motion Estimation for H.264/AVC on a VLIW–SIMD DSP

Algorithm and Software Optimization of Variable Block Size Motion Estimation for H.264/AVC on a VLIW–SIMD DSP

An Efficient VLSI Architecture of Fractional Motion Estimation in H.264 for HDTV

Lossless interpolation optimization for H.264 decoder

Contact Info

Product

Resources

About