Abstract
Recently, 360° or omnidirectional videos have become increasingly popular for both personal and enterprize use-cases. However, 360° video streaming has very high bandwidth and processing requirements. State-of-the-art viewport-based streaming solutions lower these requirements by performing selective streaming based on long-term Field-of-View (FoV) prediction mechanisms. However, sometimes user movement is extremely unpredictable during some parts of the video, and applying these solutions adversely affects the overall quality of experience (QoE). This paper proposes a novel Combined Field-of-View tile-based adaptive streaming solution (CFOV) that improves end-user QoE for 360° video streaming. CFOV performs interactive tile selection based on more accurate dynamical viewing area identification by combining the results of two FoV prediction mechanisms. It also employs an innovative priority-based bitrate adaptation approach that ensures improved bitrate budget distribution between different tiles. We evaluate the proposed solution with a comprehensive set of experiments involving four immersive videos, diverse tiling patterns (i.e., 4×3, 6×4, and 8×6), different segment lengths (i.e., 1s, 2s, and 3s), and 48 empirical head movement traces under different bandwidth settings. The evaluation employs a newly defined QoE metric specifically introduced to assess the streaming performance of 360° videos objectively. The experimental findings show that, compared to alternative approaches, our proposed solution can achieve a higher viewport match and can significantly improve the user QoE for different watching behaviors and content characteristics.
Highlights
O MNIDIRECTIONAL 360◦ video is rapidly moving towards the mainstream mainly due to the recent developments in computing, display, and networking technologies
In order to overcome the limitations of existing solutions, this paper introduces a combined FoV prediction-assisted 360◦ video streaming approach (CFOV)
Qian et al [25] proposed a practical view-based streaming system for commodity devices named Flare. They compared the performance of naive, linear regression (LR), ridge regression (RR), and support vector regression (SVR) methods on 1300 head motion datasets collected from 130 users
Summary
O MNIDIRECTIONAL 360◦ video is rapidly moving towards the mainstream mainly due to the recent developments in computing, display, and networking technologies. Conventional bitrate adaptation heuristics [33]–[38] are not able to perform accurate content adjustment during tile-based streaming in the presence of highly variable and diverse factors (e.g., available bandwidth, user movement, segment sizes, etc.) or to make the best selection as the video segments are prepared in numerous tiles and encoding bitrates. Several existing tile-based adaptive streaming solutions either increase the viewport quality aggressively [27], [29], [39] or use a conservative approach [26], [30], [40] to maintain continuous video playback. The last section includes conclusive remarks and indicates possibilities for some potential future avenues
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have