Abstract
We propose a Stereoscopic Visual Attention- (SVA-) based regional bit allocation optimization for Multiview Video Coding (MVC) by the exploiting visual redundancies from human perceptions. We propose a novel SVA model, where multiple perceptual stimuli including depth, motion, intensity, color, and orientation contrast are utilized, to simulate the visual attention mechanisms of human visual system with stereoscopic perception. Then, a semantic region-of-interest (ROI) is extracted based on the saliency maps of SVA. Both objective and subjective evaluations of extracted ROIs indicated that the proposed SVA model based on ROI extraction scheme outperforms the schemes only using spatial or/and temporal visual attention clues. Finally, by using the extracted SVA-based ROIs, a regional bit allocation optimization scheme is presented to allocate more bits on SVA-based ROIs for high image quality and fewer bits on background regions for efficient compression purpose. Experimental results on MVC show that the proposed regional bit allocation algorithm can achieve over % bit-rate saving while maintaining the subjective image quality. Meanwhile, the image quality of ROIs is improved by dB at the cost of insensitive image quality degradation of the background image.
Highlights
Three-Dimensional Video (3DV) provides Three-Dimensional (3D) depth impression and allows users to freely choose a view of a visual scene [1]
Regional bit allocation optimization experiments are performed for allocating reasonable mounts of bits among ROI and background regions and optimal ΔQP is determined
Multiview Video Coding (MVC) experiments are implemented to verify the efficiency of the Stereoscopic Visual Attention- (SVA-)based bit allocation optimization
Summary
Three-Dimensional Video (3DV) provides Three-Dimensional (3D) depth impression and allows users to freely choose a view of a visual scene [1]. Ozbek and Tekalp proposed a bit allocation among views for scalable multiview video coding [20] All these bit allocation schemes improve the average Peak Signal-to-Noise Ratio (PSNR) but did not take the regional selective properties of HVS into account. Tang et al proposed a bit allocation scheme for 2D video coding which is guided by visual sensitivity considering motion and texture structures [24]. These bit allocation schemes were proposed for single-view video coding and can not be directly applied to MVC because interview prediction is adopted in MVC. We propose a Stereoscopic Visual Attention(SVA-) based regional bit allocation for improving MVC coding efficiency. According to different types of display device, for example, HDTV, stereoscopic display, or multiview display, different number of views is displayed
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have