2014 IEEE Asia Pacific Conference on Circuits and Systems (APCCAS) 2014
DOI: 10.1109/apccas.2014.7032774
|View full text |Cite
|
Sign up to set email alerts
|

A highly parallel SAD architecture for motion estimation in HEVC encoder

Abstract: Abstract-The high computational cost of the motion estimation module in the new HEVC standard raises the need for efficient hardware architectures that can meet the real-time processing constraint. In addition, targeting HD and UHD resolutions increases the motion estimation processing cost beyond the capabilities of the currently existing architectures. This paper presents a highly parallel sum of absolute difference (SAD) architecture for motion estimation in HEVC encoder. The proposed architecture has 64 PU… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
12
0
1

Year Published

2017
2017
2023
2023

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 28 publications
(13 citation statements)
references
References 12 publications
0
12
0
1
Order By: Relevance
“…For absolute difference calculation, one method is to detect the smaller operand in the absolute difference computation |CB−RB| and to subtract it from the larger operand [4], [5].The other method comprises of complimenting the smaller of the two numbers and then performing addition of two numbers followed by plus one to compute the absolute difference [6]. To compute the absolute difference for SAD, in [7] a novel architecture is optimized for realizing efficient absolute difference circuits in Virtex-5 FPGA devices which uses the 6-input look-up tables available within the chosen devices family to maximize speed performance and to minimize the amount of occupied resources.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…For absolute difference calculation, one method is to detect the smaller operand in the absolute difference computation |CB−RB| and to subtract it from the larger operand [4], [5].The other method comprises of complimenting the smaller of the two numbers and then performing addition of two numbers followed by plus one to compute the absolute difference [6]. To compute the absolute difference for SAD, in [7] a novel architecture is optimized for realizing efficient absolute difference circuits in Virtex-5 FPGA devices which uses the 6-input look-up tables available within the chosen devices family to maximize speed performance and to minimize the amount of occupied resources.…”
Section: Related Workmentioning
confidence: 99%
“…Hardware architecture for computing the absolute difference between corresponding pixels in current and reference video block is proposed. The method used for AD calculation is as used in [6] where adder and comparator form the basic component as shown in Figure 1.The 8-bit comparator compares two numbers and returns the 1's complement of the smaller number and the larger number as it is. Proposed Architecture: In this architecture:…”
Section: Proposed Architecturementioning
confidence: 99%
“…This architecture is generally utilised for low resolution video data along with less complex applications. In [9] a highly parallel SAD architecture for motion estimation in HEVC encoder is presented where there are sixty four processing units which are operating in parallel for computation of SAD values in ME unit. Here the architecture utilises separate two memory banks one for Reference CTB and other for Instantaneous CTB.…”
Section: Related Workmentioning
confidence: 99%
“…Its hardware implementation can be helpful for the video encoder. In the literature, many works are proposed to support the SAD architecture for several application domains, such as computer vision, like motion detection for image processing [6], on the system video surveillance based motion detection and recognition in [7], that it implements on an embedded board based on XC2V1000 FPGA and motion estimation for video compression standards [8,9,10,11,12,13,14]. All the proposed architectures aim to reach real-time processing for higher resolutions sequences with the highest possible operating frequency and to compute the maximum inter-prediction blocs whatever their sizes.…”
Section: Introductionmentioning
confidence: 99%
“…The synthesis shows an operating frequency of 110 MHz with 55346 LUTs, 19744 registers, and 148kB of BRAM. More recently, Medhat et al [14] proposed a parallel hardware SAD accelerator for the motion estimation, synthetized on a Xilinx Virtix-7 XC7VX550T FPGA. An operating frequency of 458 MHz has been reported with 39901 LUTs and 24957 registers.…”
Section: Introductionmentioning
confidence: 99%