Lifting-based invertible motion adaptive transform (LIMAT) framework for highly scalable video compression

Secker, Andrew; Taubman, David

doi:10.1109/tip.2003.819433

Cited by 213 publications

(146 citation statements)

References 19 publications

Supporting

Mentioning

146

Contrasting

Order By: Relevance

“…To evaluate the coding performance of the proposed encoder, we compare it with a MCTF approach [25] and with two different configuration of the H.264/AVC reference software JM15.1 [46]. In the firs configuratio of JM15.1 (H.264 simp ), the test conditions are set so that only similar tools to the ones implemented in our encoder are enabled.…”

Section: A Video Coding Resultsmentioning

confidence: 99%

“…Once the transform is defined we propose a coefficien reordering approach and an entropy coder, leading to a complete video encoder. On average, our proposed system achieves improvements of 1.24 dB with respect to a MCTF encoder [25] and 0.34 dB with respect to a simplifie encoder derived from H.264/AVC (reference software JM15.1 configure to use tools similar to those in the proposed encoder, i.e., 1 reference frame, no subpixel motion estimation, 16 × 16 inter and 4 × 4 intra modes), for a variety of standard QCIF and CIF video sequences. These improvements are more significan at high qualities, where they are in the range of 1 to 3 dBs with respect to the simplifie H.264/AVC video encoder, obtaining similar coding results in six out of twelve test sequences when comparing to JM15.1 configure allowing 5 reference frames, all the inter and intra modes available, and motion estimation similar to the proposed encoder (subpixel motion estimation disabled).…”

Section: Contributionsmentioning

confidence: 94%

“…In the latter, each frame is firs wavelet transformed in the spatial domain, followed by MCTF. Representative examples of MCTF implementations are [25], [26] and [27]. These approaches can be described as separable because spatial and temporal filterin are applied in separate steps.…”

Section: B Related Workmentioning

confidence: 99%

See 2 more Smart Citations

Directional Transforms for Video Coding Based on Lifting on Graphs

Martinez-Enriquez

Cid-Sueiro

Díaz-de-María

et al. 2018

IEEE Trans. Circuits Syst. Video Technol.

View full text Add to dashboard Cite

Abstract-In this work we describe and optimize a general scheme based on lifting transforms on graphs for video coding. A graph is constructed to represent the video signal. Each pixel becomes a node in the graph and links between nodes represent similarity between them. Therefore, spatial neighbors and temporal motion-related pixels can be linked, while nonsimilar pixels (e.g., pixels across an edge) may not be. Then, a lifting-based transform, in which filterin operations are performed using linked nodes, is applied to this graph, leading to a 3-dimensional (spatio-temporal) directional transform which can be viewed as an extension of wavelet transforms for video. The design of the proposed scheme requires four main steps: (i) graph construction, (ii) graph splitting, (iii) filte design, and (iv) extension of the transform to different levels of decomposition. We focus on the optimization of these steps in order to obtain an effective transform for video coding. Furthermore, based on this scheme, we propose a coefficien reordering method and an entropy coder leading to a complete video encoder that achieves better coding performance than a motion compensated temporal filterin wavelet-based encoder and a simple encoder derived from H.264/AVC that makes use of similar tools as our proposed encoder (reference software JM15.1 configu ed to use 1 reference frame, no subpixel motion estimation, 16 × 16 inter and 4 × 4 intra modes).

show abstract

Section: A Video Coding Resultsmentioning

confidence: 99%

Section: Contributionsmentioning

confidence: 94%

See 1 more Smart Citation

Directional Transforms for Video Coding Based on Lifting on Graphs

Martinez-Enriquez

Cid-Sueiro

Díaz-de-María

et al. 2018

IEEE Trans. Circuits Syst. Video Technol.

View full text Add to dashboard Cite

show abstract

“…Well-known examples include bandelets [78], edge-adapted multiscale transform [17], wedgelets [32,101], wavelet footprints [35], best tree-based representations [43,85], directionlets [96], motion-adaptive transform for videos [81], adaptive directional lifting [13,26], and grouplets [66]. We omit further discussions on these adaptive signal representations and refer readers to the references cited above for more details.…”

Section: Other Multiscale Geometric Representationsmentioning

confidence: 99%

Multidimensional Filter Banks and Multiscale Geometric Representations

Minh

2011

FNT in Signal Processing

View full text Add to dashboard Cite

“…In fact, most recent interest in wavelet-based video coding has migrated away from the traditional hybrid MC-feedback architecture considered here in favor of motion-compensated temporal filtering (MCTF) in order to provide full fidelity, spatial, and temporal scalability. Recent MCTF-based video coders have employed the RDWT (e.g., [33,2,22]), while others have used meshes (e.g., [27,28]). Indeed, our recent work [31,30] has focused on combining uniform meshes and the RDWT within the MCTF framework and has produced a scalable coder with state-of-the-art rate-distortion performance.…”

Section: Introductionmentioning

confidence: 99%

Motion estimation and compensation in the redundant-wavelet domain using triangle meshes

Cui¹,

Wang²,

Fowler³

2006

Signal Processing: Image Communication

View full text Add to dashboard Cite

In this paper, a technique is presented that incorporates an irregular triangle mesh into wavelet-domain motion estimation and compensation using a shift-invariant redundant-wavelet transform. The main contribution of this work resides in a demonstration that triangle-mesh motion estimation and compensation can be deployed more effectively in the redundant-wavelet domain thanks to a simple correlation operator that is robust to the prediction residual, a noise-like signal that hinders spatial-domain gradient-based efforts to locate image edges which persist from frame to frame. In the proposed technique, motion compensation takes place through an affine transform mapping triangles from one frame to the next, while the motion-compensated residual is downsampled to a non-redundant form which is then coded using any wavelet-based still-image coder. Experimental results indicate that the proposed technique significantly outperforms not only the usual spatial-domain approach to mesh-based motion estimation and compensation, but also the emerging wavelet-based coders that employ traditional block-based structures, particularly for sequences with fast or complex motion. r

show abstract

Lifting-based invertible motion adaptive transform (LIMAT) framework for highly scalable video compression

Cited by 213 publications

References 19 publications

Directional Transforms for Video Coding Based on Lifting on Graphs

Directional Transforms for Video Coding Based on Lifting on Graphs

Multidimensional Filter Banks and Multiscale Geometric Representations

Motion estimation and compensation in the redundant-wavelet domain using triangle meshes

Contact Info

Product

Resources

About