Abstract

We describe a fully scalable wavelet-based 2D+t (in-band) video coding architecture. We propose new coding tools specifically designed for this framework aimed at two goals: reduce the computational complexity at the encoder without sacrificing compression; improve the coding efficiency, especially at low bitrates. To this end, we focus our attention on motion estimation and motion vector encoding. We propose a fast motion estimation algorithm that works in the wavelet domain and exploits the geometrical properties of the wavelet subbands. We show that the computational complexity grows linearly with the size of the search window, yet approaching the performance of a full search strategy. We extend the proposed motion estimation algorithm to work with blocks of variable sizes, in order to better capture localmotion characteristics, thus improving in terms of rate-distortion behavior. Given this motion field representation, we propose a motion vector coding algorithm that allows to adaptively scale the motion bit budget according to the target bitrate, improving the coding efficiency at low bitrates. Finally, we show how to optimally scale the motion field when the sequence is decoded at reduced spatial resolution. Experimental results illustrate the advantages of each individual coding tool presented in this paper. Based on these simulations, we define the best configuration of coding parameters and we compare the proposed codec with MC-EZBC, a widely used reference codec implementing the t+2D framework.

Highlights

  • Today’s video streaming applications require codecs to provide a bitstream that can be flexibly adapted to the characteristics of the network and the receiving device

  • We propose the following: (i) in Section 2.1, a fast motion estimation algorithm that is meant to work in the wavelet domain (FIBME—fast in-band motion estimation), exploiting the geometrical properties of the wavelet subbands

  • We show that motion vector truncation, adopted in the reference implementation of the MC-EZBC codec [5], is not the optimal choice when the motion field resolution needs to be scaled

Read more

Summary

INTRODUCTION

Today’s video streaming applications require codecs to provide a bitstream that can be flexibly adapted to the characteristics of the network and the receiving device. A 2D+t scheme comes with the advantage of reducing the impact of blocking artifacts caused by the failure of block-based motion models This is because such artifacts are canceled out by the inverse DWT spatial transform, without the need to adopt some sort of deblocking filtering. As shown in [14, 15], 2D+t approaches naturally fit the spatial scalability requirements providing higher coding efficiency when the sequence is decoded at reduced spatial resolution This is due to the fact that with in-band motion compensation it is possible to limit the problem of drift that occurs when decoder does not have access to all the wavelet subbands used at the encoder side. 2D+t schemes naturally support multi-hypothesis motion compensation taking advantage of the redundancy of the ODWT [10]

Motivations and goals
Related works
Fast in-band motion estimation
Variable size block matching
Scalable coding of motion vectors
Motion vectors and spatial scalability
EXPERIMENTAL RESULTS
CONCLUSIONS
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call