Prediction architecture based on block matching statistics for mixed spatial-resolution multi-view video coding

Hany Said,Claude C Chibelushi,Mansour Moniri

doi:10.1186/s13640-017-0164-7

Abstract

The use of mixed spatial resolutions in multi-view video coding is a promising approach for coding videos efficiently at low bitrates. It can achieve a perceived quality, which is close to the view with the highest quality, according to the suppression theory of binocular vision. The aim of the work reported in this paper is to develop a new multi-view video coding technique suitable for low bitrate applications in terms of coding efficiency, computational and memory complexity, when coding videos, which contain either a single or multiple scenes. The paper proposes a new prediction architecture that addresses deficiencies of prediction architectures for multi-view video coding based on H.264/AVC. The prediction architectures which are used in mixed spatial-resolution multi-view video coding (MSR-MVC) are afflicted with significant computational complexity and require significant memory size, with regards to coding time and to the minimum number of reference frames. The architecture proposed herein is based on a set of investigations, which explore the effect of different inter-view prediction directions on the coding efficiency of multi-view video coding, conduct a comparative study of different decimation and interpolation methods, in addition to analyzing block matching statistics. The proposed prediction architecture has been integrated with an adaptive reference frame ordering algorithm, to provide an efficient coding solution for multi-view videos with hard scene changes. The paper includes a comparative performance assessment of the proposed architecture against an extended architecture based on the 3D digital multimedia broadcast (3D-DMB) and the Hierarchical B-Picture (HBP) architecture, which are two most widely used architectures for MSR-MVC. The assessment experiments show that the proposed architecture needs less bitrate by on average 13.1 Kbps, less coding time by 14% and less memory consumption by 31.6%, compared to a corresponding codec, which deploys the extended 3D-DMB architecture when coding single-scene videos. Furthermore, the codec, which deploys the proposed architecture, accelerates coding by on average 57% and requires 52% less memory, compared to a corresponding codec, which uses the HBP architecture. On the other hand, multi-view video coding which uses the proposed architecture needs more bitrate by on average 24.9 Kbps compared to a corresponding codec that uses the HBP architecture. For coding a multi-view video which has hard scene changes, the proposed architecture yields less bitrate (by on average 28.7 to 35.4 Kbps), and accelerates coding time (by on average 64 and 33%), compared to the HBP and extended 3D-DMB architectures, respectively. The proposed architecture will thus be most beneficial in low bitrate applications, which require multi-view video coding for video content depicting hard scene changes.

Highlights

1.1 Context and related work The mixed spatial-resolution coding approach provides a better solution for multi-view video than the symmetric coding approach, at low bitrates
A new prediction architecture is proposed in Section 4, and it is integrated with the adaptive reference frame ordering algorithm
Since H.264/AVC enables inter-picture prediction at a level of quarter-pixels, each reference frame is represented by 16 samples which include: one integer sample; three Half-Pixel (H-Pel) samples; and twelve Quarter-Pixel (Q-Pel) samples

Summary

Introduction

1.1 Context and related work The mixed spatial-resolution coding approach provides a better solution for multi-view video than the symmetric coding approach, at low bitrates. It has been reported that mixed spatial-resolution stereoscopic video coding has less coding complexity and provides better rate-distortion than symmetric coding [1,2,3]. According to the suppression theory of binocular vision, the total perceived quality for mixed spatialresolution stereoscopic video is close to the view with the highest quality (the view with full spatial-resolution frames) [2, 6]. This is due to the high frequency components (which exist in the full spatial-resolution frames) which compensate the corresponding components in the lower spatial-resolution frames [7]. The mixed spatial-resolution approach provides better perceived quality than other coding approaches when coding multi-view videos at low bitrates [2, 9]

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: EURASIP Journal on Image and Video Processing	Publication Date: Feb 13, 2017
Citations: 4	License type: open-access

R Discovery Prime

R Discovery Prime

Prediction architecture based on block matching statistics for mixed spatial-resolution multi-view video coding

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EURASIP Journal on Image and Video Processing

Lead the way for us

Similar Papers

MB-based Decoder Buffer Reduction for Multi-view Video Coding
Pei-Jun Lee ... Jin-Shun Huang
Journal of Signal Processing Systems | VOL. 80
Pei-Jun Lee, et. al.Pei-Jun Lee ... Jin-Shun Huang
25 Jan 2014
Journal of Signal Processing Systems | VOL. 80

Fine-Granular Motion Matching for Inter-View Motion Skip Mode in Multiview Video Coding
Haitao Yang ... Yilin Chang
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 19
Haitao Yang, et. al. Haitao Yang ... Yilin Chang
01 Jun 2009
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 19

Multi-view video coding combined with adaptive prediction structure and fast mode selection
Xiaolan Wang
-
Xiaolan WangXiaolan Wang
02 Dec 2022
02 Dec 2022

Lossless fragile watermarking algorithm in compressed domain for multiview video coding
Wei Gao ... Mei Yu
Multimedia Tools and Applications | VOL. 78
Wei Gao, et. al.Wei Gao ... Mei Yu
29 Aug 2018
Multimedia Tools and Applications | VOL. 78

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Prediction architecture based on block matching statistics for mixed spatial-resolution multi-view video coding

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EURASIP Journal on Image and Video Processing