Abstract
We describe a quad-tree based variable size block matching (VSBM) motion estimation algorithm which is as computationally efficient as fixed size block matching (FSBM) and yet provides a better quality prediction. The 'match and merge' scheme allows the dimensions of blocks to adapt to local activity within the image, and the total number of blocks in any frame can be varied while still representing true motion fairly accurately. This permits adaptive it allocation between the representation of displacement and residual data, and the variation of the overall bit allowance on a frame-by-frame basis. The cost of coding the motion information from the VSBM technique is compared with the 2D motion vector prediction adopted by H.263 and MPEG-4 using FSBM with 16 by 16 macroblocks. 1D and 2D VSBM motion vector prediction strategies are described. The techniques are evaluated using two complete MPEG-4 test sequences. For similar quality prediction (same mean square error), 16 percent fewer bits are required to ode the motion vectors from the 'Foreman' sequence using the VSBM technique and a 2D predictor. The saving increases to 68 percent for the 'Container Ship' sequence in which there is less disparate motion. The cost of including the quad- tree description is included in both cases.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have