Spatial scalable video service has surged in the time of multiple screens. Existing bitrate allocation methods are principled by rate-distortion theory and characterized by iterative encoding, which is accurate yet complex. However, the quasi-quantitative description is preferred in practice of broadcasting. In this paper, we propose a task of bitrate estimation for scalable videos concerning the content, aiming at a more efficient model at the cost of precision. First, we exhibit necessity to build a model for Scalable High Efficiency Video Coding (SHVC) and quantitative relation between video content and bitrate using different encoders. Then, a scalable-video dataset is prepared. It covers various types of content to offer diversity for model training. In the end, multi-linear regression is utilized to estimate the bitrate of scalable videos, with spatial and temporal indices as explanatory variables. Our statistical experiments show the model is able to estimate bitrate after trained on the self-built dataset.
Read full abstract