Delivery of omnidirectional video using saliency prediction and optimal bitrate allocation

Cagri Ozcinar,Weimin Wang,Aljosa Smolic,Nevrez İmamoğlu

doi:10.1007/s11760-020-01769-2

Cagri Ozcinar, Weimin Wang + Show 2 more

Open Access

https://doi.org/10.1007/s11760-020-01769-2

Copy DOI

Abstract

In this work, we propose and investigate a user-centric framework for the delivery of omnidirectional video (ODV) on VR systems by taking advantage of visual attention (saliency) models for bitrate allocation module. For this purpose, we formulate a new bitrate allocation algorithm that takes saliency map and nonlinear sphere-to-plane mapping into account for each ODV and solve the formulated problem using linear integer programming. For visual attention models, we use both image- and video-based saliency prediction results; moreover, we explore two types of attention model approaches: (i) salient object detection with transfer learning using pre-trained networks, (ii) saliency prediction with supervised networks trained on eye-fixation dataset. Experimental evaluations on saliency integration of models are discussed with interesting findings on transfer learning and supervised saliency approaches.

Full Text