Global-local correlation-based early large-size mode decision for multiview video coding

Wei Zhu,Jiani Xie,Peng Chen,Yayu Zheng

doi:10.1117/1.jei.23.1.013027

Abstract

Multiview video coding (MVC) is a recent extension of H.264/AVC, and it consumes huge encoding time to select the optimal macroblock (MB) mode, among different size candidate modes. As compared with the small-size mode (Inter16 × 8, Inter8 × 16, Inter8 × 8, Intra8 × 8, and Intra4 × 4), the large-size mode (Skip/Direct, Inter16 × 16, and Intra16 × 16) occupies most of the MB mode proportion with much less computational com- plexity. Thus, if the large-size mode could be early decided as the optimal MB mode, the complexity of mode decision could be effectively reduced. In this work, an early large-size mode decision algorithm is proposed based on the global correlation of rate-distortion (RD) costs between neighbor views and the local correlation of RD costs among candidate modes. Average RD costs of large-size and small-size MB modes in the neighbor view are employed as a global reference for the threshold of early decision. And RD costs of estimated modes are used to calculate the local adjustment for the threshold. Experimental results demonstrate that the proposed algorithm can significantly reduce the whole encoding time while maintaining an RD performance similar to that of the original MVC encoder. © The Authors. Published by SPIE under a Creative Commons Attribution 3.0 Unported License. Distribution or reproduction of this work in whole or in part requires full attribution of the original publication, including its DOI. (DOI: 10 .1117/1.JEI.23.1.013027)

Highlights

Multiview video is captured from a set of viewpoints, and it is useful in many multimedia applications, such as threedimensional (3-D) television, free viewpoint television, and glass-free portable 3-D display
The correlation of coding information between neighbor views and the RD costs of MB mode in Multiview video coding (MVC) are usually employed to arrive at a faster mode decision
The view-adaptive motion estimation and disparity estimation (VAMEDE),[13] which includes mode size decision, fast motion estimation, and selective disparity estimation, was implemented on JMVC for interview views, and it achieves 79.6% average time saving with 0.04-dB PSNR loss and 1.79% bit rate increment (0.10-dB Bjontegaard delta peak signal-to-noise ratio (BDPSNR) loss and 3.16% Bjontegaard delta bit rate (BDBR) increment)

Summary

Introduction

Multiview video is captured from a set of viewpoints, and it is useful in many multimedia applications, such as threedimensional (3-D) television, free viewpoint television, and glass-free portable 3-D display. These algorithms presented some effective optimization techniques, such as adaptive termination strategy,[7,8,9,10] candidate modes selection,[10,11,12,13] prediction direction selection,[12,13,14] and early Skip/Direct mode decision.[14,15,16,17,18] For the reduction of the whole complexity, Shen et al.[13] combined the candidate modes selection with the fast motion estimation and the prediction direction selection; Khattak et al.[19] provided a complete framework that includes mode decision and reference frame selection and fast motion/disparity estimation In these algorithms, the correlation of coding information between neighbor views and the RD costs of MB mode in MVC are usually employed to arrive at a faster mode decision.

Motivation and Analysis

Proposed Early Large-Size Mode Decision Algorithm

Experimental Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Global-local correlation-based early large-size mode decision for multiview video coding

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Electronic Imaging

Lead the way for us

Journal: Journal of Electronic Imaging	Publication Date: Feb 19, 2014
License type: cc-by

Similar Papers

Efficient early direct mode decision for multi-view video coding
Fengsui Wang ... Sidan Du
Signal Processing: Image Communication | VOL. 28
Fengsui Wang, et. al.Fengsui Wang ... Sidan Du
22 May 2013
Signal Processing: Image Communication | VOL. 28

Fast macroblock encoding algorithm based on rate-distortion activity for multiview video coding
Wei Zhu ... Jie Feng
Signal Processing: Image Communication | VOL. 29
Wei Zhu, et. al.Wei Zhu ... Jie Feng
10 Jun 2014
Signal Processing: Image Communication | VOL. 29

Fast Inter Mode Decision Based on RD Costs and Frequencies of Modes
Wei Li ... Xueli Huang
International Journal of Distributed Sensor Networks | VOL. 5
Wei Li, et. al.Wei Li ... Xueli Huang
01 Jan 2009
International Journal of Distributed Sensor Networks | VOL. 5

Early DIRECT Mode Decision for MVC Using MB Mode Homogeneity and RD Cost Correlation
Yue Li ... Gaobo Yang
IEEE Transactions on Broadcasting | VOL. 62
Yue Li, et. al.Yue Li ... Gaobo Yang
01 Sep 2016
IEEE Transactions on Broadcasting | VOL. 62

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Global-local correlation-based early large-size mode decision for multiview video coding

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Electronic Imaging