Multi-view Coding Research Articles

The use of mixed spatial resolutions in multi-view video coding is a promising approach for coding videos efficiently at low bitrates. It can achieve a perceived quality, which is close to the view with the highest quality, according to the suppression theory of binocular vision. The aim of the work reported in this paper is to develop a new multi-view video coding technique suitable for low bitrate applications in terms of coding efficiency, computational and memory complexity, when coding videos, which contain either a single or multiple scenes. The paper proposes a new prediction architecture that addresses deficiencies of prediction architectures for multi-view video coding based on H.264/AVC. The prediction architectures which are used in mixed spatial-resolution multi-view video coding (MSR-MVC) are afflicted with significant computational complexity and require significant memory size, with regards to coding time and to the minimum number of reference frames. The architecture proposed herein is based on a set of investigations, which explore the effect of different inter-view prediction directions on the coding efficiency of multi-view video coding, conduct a comparative study of different decimation and interpolation methods, in addition to analyzing block matching statistics. The proposed prediction architecture has been integrated with an adaptive reference frame ordering algorithm, to provide an efficient coding solution for multi-view videos with hard scene changes. The paper includes a comparative performance assessment of the proposed architecture against an extended architecture based on the 3D digital multimedia broadcast (3D-DMB) and the Hierarchical B-Picture (HBP) architecture, which are two most widely used architectures for MSR-MVC. The assessment experiments show that the proposed architecture needs less bitrate by on average 13.1 Kbps, less coding time by 14% and less memory consumption by 31.6%, compared to a corresponding codec, which deploys the extended 3D-DMB architecture when coding single-scene videos. Furthermore, the codec, which deploys the proposed architecture, accelerates coding by on average 57% and requires 52% less memory, compared to a corresponding codec, which uses the HBP architecture. On the other hand, multi-view video coding which uses the proposed architecture needs more bitrate by on average 24.9 Kbps compared to a corresponding codec that uses the HBP architecture. For coding a multi-view video which has hard scene changes, the proposed architecture yields less bitrate (by on average 28.7 to 35.4 Kbps), and accelerates coding time (by on average 64 and 33%), compared to the HBP and extended 3D-DMB architectures, respectively. The proposed architecture will thus be most beneficial in low bitrate applications, which require multi-view video coding for video content depicting hard scene changes.

Read full abstract

Augmented reality, interactive navigation in 3D scenes, multiview video, and other emerging multimedia applications require large sets of images, hence larger data volumes and increased resources compared with traditional video services. The significant increase in the number of images in multiview systems leads to new challenging problems in data representation and data transmission to provide high quality of experience on resource-constrained environments. In order to reduce the size of the data, different multiview video compression strategies have been proposed recently. Most of them use the concept of reference or key views that are used to estimate other images when there is high correlation in the data set. In such coding schemes, the two following questions become fundamental: 1) how many reference views have to be chosen for keeping a good reconstruction quality under coding cost constraints? And 2) where to place these key views in the multiview data set? As these questions are largely overlooked in the literature, we study the reference view selection problem and propose an algorithm for the optimal selection of reference views in multiview coding systems. Based on a novel metric that measures the similarity between the views, we formulate an optimization problem for the positioning of the reference views, such that both the distortion of the view reconstruction and the coding rate cost are minimized. We solve this new problem with a shortest path algorithm that determines both the optimal number of reference views and their positions in the image set. We experimentally validate our solution in a practical multiview distributed coding system and in the standardized 3D-HEVC multiview coding scheme. We show that considering the 3D scene geometry in the reference view, positioning problem brings significant rate-distortion improvements and outperforms the traditional coding strategy that simply selects key frames based on the distance between cameras.

Read full abstract

Multi-view Coding Research Articles

Related Topics

Articles published on Multi-view Coding

Temporal–Spatial Symmetric Distributed Multi-View Video Coding Scheme

Efficient multiview video plus depth coding for 3D-HEVC based on complexity classification of the treeblock

Cross-layer optimized authentication and error control for wireless 3D medical video streaming over LTE

Prediction architecture based on block matching statistics for mixed spatial-resolution multi-view video coding

Improving view random access via increasing hierarchical levels for multi-view video coding

Adaptive Bit Allocation for 3D Video Coding

Early DIRECT Mode Decision for MVC Using MB Mode Homogeneity and RD Cost Correlation

Light Field Multi-View Video Coding With Two-Directional Parallel Inter-View Prediction.

Parallel Multiview Video Coding Exploiting Group of Pictures Level Parallelism

A Modified Multiview Video Streaming System Using 3-Tier Architecture

Hierarchical modulation for client-driven selective streaming of multi-view video over AWGN channels

Texture-Aware Depth Prediction in 3D Video Coding

Big video data for light-field-based 3D telemedicine

Enhanced view random access ability for multiview video coding

Adaptive early termination mode decision for 3D-HEVC using inter-view and spatio-temporal correlations

Error‐Resilient Multi‐view Video Coding Based on End‐to‐End Rate‐Distortion Optimization

Optimal reference view selection algorithm for low complexity disparity estimation

A Scalable Massively Parallel Motion and Disparity Estimation Scheme for Multiview Video Coding

Reference View Selection in DIBR-Based Multiview Coding.

Early DIRECT mode decision based on all‐zero block and rate distortion cost for multiview video coding

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Multi-view Coding Research Articles

Related Topics

Articles published on Multi-view Coding

Temporal–Spatial Symmetric Distributed Multi-View Video Coding Scheme

Efficient multiview video plus depth coding for 3D-HEVC based on complexity classification of the treeblock

Cross-layer optimized authentication and error control for wireless 3D medical video streaming over LTE

Prediction architecture based on block matching statistics for mixed spatial-resolution multi-view video coding

Improving view random access via increasing hierarchical levels for multi-view video coding

Adaptive Bit Allocation for 3D Video Coding

Early DIRECT Mode Decision for MVC Using MB Mode Homogeneity and RD Cost Correlation

Light Field Multi-View Video Coding With Two-Directional Parallel Inter-View Prediction.

Parallel Multiview Video Coding Exploiting Group of Pictures Level Parallelism

A Modified Multiview Video Streaming System Using 3-Tier Architecture

Hierarchical modulation for client-driven selective streaming of multi-view video over AWGN channels

Texture-Aware Depth Prediction in 3D Video Coding

Big video data for light-field-based 3D telemedicine

Enhanced view random access ability for multiview video coding

Adaptive early termination mode decision for 3D-HEVC using inter-view and spatio-temporal correlations

Error‐Resilient Multi‐view Video Coding Based on End‐to‐End Rate‐Distortion Optimization

Optimal reference view selection algorithm for low complexity disparity estimation

A Scalable Massively Parallel Motion and Disparity Estimation Scheme for Multiview Video Coding

Reference View Selection in DIBR-Based Multiview Coding.

Early DIRECT mode decision based on all‐zero block and rate distortion cost for multiview video coding