Abstract

The 3D High Efficiency Video Coding (3D-HEVC) standard aims to code 3D videos that usually contain multi-view texture videos and its corresponding depth information. It inherits the same quadtree prediction structure of HEVC to code both texture videos and depth maps. Each coding unit (CU) allows recursively splitting into four equal sub-CUs. At each CU depth level, it enables 10 types of inter modes and 35 types of intra modes in inter frames. Furthermore, the inter-view prediction tools are applied to each view in the test model of 3D-HEVC (HTM), which uses variable size disparity-compensated prediction to exploit inter-view correlation within neighbor views. It also exploits redundancies between a texture video and its associated depth using inter-component coding tools. These achieve the highest coding efficiency to code 3D videos but require a very high computational complexity. In this paper, we propose a context-adaptive based fast CU processing algorithm to jointly optimize the most complex components of HTM including CU depth level decision, mode decision, motion estimation (ME) and disparity estimation (DE) processes. It is based on the hypothesis that the optimal CU depth level, prediction mode and motion vector of a CU are correlated with those from spatiotemporal, inter-view and inter-component neighboring CUs. We analyze the video content based on coding information from neighboring CUs and early predict each CU into one of five categories i.e., DE-omitted CU, ME-DE-omitted CU, SPLIT CU, Non-SPLIT CU and normal CU, and then each type of CU adaptively adopts different processing strategies. Experimental results show that the proposed algorithm saves 70% encoder runtime on average with only a 0.1% BD-rate increase on coded views and 0.8% BD-rate increase on synthesized views. Our algorithm outperforms the state-of-the-art algorithms in terms of coding time saving or with better RD performance.

Highlights

  • In the recently years, 3D video has undergone a rapid development from the release of 3D film and 3D video game to the emergence of 3D services such as stereoscopic 3DTV and free viewpoint television (FTV)

  • In order to test the performance of the proposed context-adaptive based coding unit (CU) processing algorithm (CACUP) for 3D extension of HEVC (3D-HEVC) including four components, early Non-SPLIT CU determination, early SPLIT CU determination, early disparity estimation (DE)-omitted CU determination and early motion estimation (ME)-DEomitted CU determination, it is implemented on the 3D-HEVC reference software, HTM 16.0

  • Coding efficiency is measured with PSNR and bitrate, and computational complexity is measured with the consumed encoding time

Read more

Summary

Introduction

3D video has undergone a rapid development from the release of 3D film and 3D video game to the emergence of 3D services such as stereoscopic 3DTV and free viewpoint television (FTV). The inter-view prediction is applied to each view in 3D-HEVC, which uses the variable-sized prediction techniques of HEVC to exploit the interview correlation between neighboring views It exploits redundancies between texture videos and its associated depth using inter-component coding tools. The cost for mode decision JRDO is specified by the following formula, JRDO 1⁄4 ðSSElumaþ ochroma  SSEchroma ÞþlÂB ð1Þ where B denotes the bitrate cost, which depends on each decision case, SSE is the sum of squared error between the current CU and the matching block, λ is the Lagrange multiplier and ωchroma is the weighing factor for chroma components Both CU depth level decision and prediction mode decision require heavy computational complexity for a CU.

Overview of related works on fast encoding methods
Observation and analysis
Early DE-omitted CU determination
Early ME-DE-omitted CU determination
Early SPLIT CU determination
Early Non-SPLIT CU determination
Flowchart of the proposed algorithm
Test conditions
Comparison with the state-of-the-art fast 3D-HEVC algorithms
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.