IEEE Workshop on Signal Processing Systems Design and Implementation, 2005.
DOI: 10.1109/sips.2005.1579884
|View full text |Cite
|
Sign up to set email alerts
|

H.264/AVC interpolation optimization

Abstract: This paper discusses the optimization of the H.264/AVC sub-pixel interpolation operation in the context of a software implementation on a subword parallel processor. Several known algorithmic and architectural optimization approaches are combined to achieve a low-cost interpolation implementation. The proposed interpolation scheme, which produces identical results with the reference software, requires no multiplications and 16-bit integer arithmetic is sufficient for the computation. The instruction set extens… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Publication Types

Select...
2
2

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(1 citation statement)
references
References 8 publications
0
1
0
Order By: Relevance
“…The horizontal and vertical filtering with dot product instructions can generate eight samples at each iteration, and achieve the iteration interval of 8 cycles and 11 cycles, respectively. Although the use of dot product instructions seems less efficient than using shift and addition instructions [22,23], the dotproduct implementation shows better performances, 3.7 times faster for the horizontal filtering and 6.1 times faster for the vertical filtering because it utilizes the hardware multipliers.…”
Section: Vliw-simd and Program Optimizationmentioning
confidence: 99%
“…The horizontal and vertical filtering with dot product instructions can generate eight samples at each iteration, and achieve the iteration interval of 8 cycles and 11 cycles, respectively. Although the use of dot product instructions seems less efficient than using shift and addition instructions [22,23], the dotproduct implementation shows better performances, 3.7 times faster for the horizontal filtering and 6.1 times faster for the vertical filtering because it utilizes the hardware multipliers.…”
Section: Vliw-simd and Program Optimizationmentioning
confidence: 99%