Abstract

Multiview video coding (MVC) is a recent extension of H.264/AVC, and it consumes huge encoding time to select the optimal macroblock (MB) mode, among different size candidate modes. As compared with the small-size mode (Inter16 × 8, Inter8 × 16, Inter8 × 8, Intra8 × 8, and Intra4 × 4), the large-size mode (Skip/Direct, Inter16 × 16, and Intra16 × 16) occupies most of the MB mode proportion with much less computational com- plexity. Thus, if the large-size mode could be early decided as the optimal MB mode, the complexity of mode decision could be effectively reduced. In this work, an early large-size mode decision algorithm is proposed based on the global correlation of rate-distortion (RD) costs between neighbor views and the local correlation of RD costs among candidate modes. Average RD costs of large-size and small-size MB modes in the neighbor view are employed as a global reference for the threshold of early decision. And RD costs of estimated modes are used to calculate the local adjustment for the threshold. Experimental results demonstrate that the proposed algorithm can significantly reduce the whole encoding time while maintaining an RD performance similar to that of the original MVC encoder. © The Authors. Published by SPIE under a Creative Commons Attribution 3.0 Unported License. Distribution or reproduction of this work in whole or in part requires full attribution of the original publication, including its DOI. (DOI: 10 .1117/1.JEI.23.1.013027)

Highlights

  • Multiview video is captured from a set of viewpoints, and it is useful in many multimedia applications, such as threedimensional (3-D) television, free viewpoint television, and glass-free portable 3-D display

  • The correlation of coding information between neighbor views and the RD costs of MB mode in Multiview video coding (MVC) are usually employed to arrive at a faster mode decision

  • The view-adaptive motion estimation and disparity estimation (VAMEDE),[13] which includes mode size decision, fast motion estimation, and selective disparity estimation, was implemented on JMVC for interview views, and it achieves 79.6% average time saving with 0.04-dB PSNR loss and 1.79% bit rate increment (0.10-dB Bjontegaard delta peak signal-to-noise ratio (BDPSNR) loss and 3.16% Bjontegaard delta bit rate (BDBR) increment)

Read more

Summary

Introduction

Multiview video is captured from a set of viewpoints, and it is useful in many multimedia applications, such as threedimensional (3-D) television, free viewpoint television, and glass-free portable 3-D display. These algorithms presented some effective optimization techniques, such as adaptive termination strategy,[7,8,9,10] candidate modes selection,[10,11,12,13] prediction direction selection,[12,13,14] and early Skip/Direct mode decision.[14,15,16,17,18] For the reduction of the whole complexity, Shen et al.[13] combined the candidate modes selection with the fast motion estimation and the prediction direction selection; Khattak et al.[19] provided a complete framework that includes mode decision and reference frame selection and fast motion/disparity estimation In these algorithms, the correlation of coding information between neighbor views and the RD costs of MB mode in MVC are usually employed to arrive at a faster mode decision.

Motivation and Analysis
Proposed Early Large-Size Mode Decision Algorithm
Experimental Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.