Abstract

The recent scalable video coding (SVC) extension to the H.264/AVC video coding standard has unprecedented compression efficiency while supporting a wide range of scalability modes, including temporal, spatial, and quality (SNR) scalability, as well as combined spatiotemporal SNR scalability. The traffic characteristics, especially the bit rate variabilities, of the individual layer streams critically affect their network transport. We study the SVC traffic statistics, including the bit rate distortion and bit rate variability distortion, with long CIF resolution video sequences and compare them with the corresponding MPEG-4 Part 2 traffic statistics. We consider (i) temporal scalability with three temporal layers, (ii) spatial scalability with a QCIF base layer and a CIF enhancement layer, as well as (iii) quality scalability modes FGS and MGS. We find that the significant improvement in RD efficiency of SVC is accompanied by substantially higher traffic variabilities as compared to the equivalent MPEG-4 Part 2 streams. We find that separately analyzing the traffic of temporal-scalability only encodings gives reasonable estimates of the traffic statistics of the temporal layers embedded in combined spatiotemporal encodings and in the base layer of combined FGS-temporal encodings. Overall, we find that SVC achieves significantly higher compression ratios than MPEG-4 Part 2, but produces unprecedented levels of traffic variability, thus presenting new challenges for the network transport of scalable video.

Highlights

  • We study the video traffic generated by the scalable video coding (SVC) extension [1, 2] of the H.264/MPEG-4 advanced video coding standard [3] (H.264 SVC for brevity)

  • In the first column of the table, the encoder quantization scales are specified for MPEG-4 fine granularity scalability (FGS) (Mp4f) and SVC fine granularity scalability (SVC FGS) (SVF)

  • We examined the video traffic characteristics of the temporal, spatial, and FGS scalability modes of the scalable video coding (SVC) extension of the H.264/AVC standard and Traffic variability (CoV) Traffic variability (CoV)

Read more

Summary

INTRODUCTION

We study the video traffic generated by the scalable video coding (SVC) extension [1, 2] of the H.264/MPEG-4 advanced video coding standard [3] (H.264 SVC for brevity). SVC-encoded scalable video is for the first time examined in the present study. Existing studies of the H.264/AVC codec and its SVC extension, such as [3, 65, 66], focus primarily on the bit rate-distortion (RD) performance, that is, the video quality (PSNR) as a function of the average bit rate, and typically consider only short video sequences up to a few hundred frames. In order to obtain reliable and meaningful statistical estimates of the traffic variability and other properties, it is necessary to examine long video sequences with several thousand frames, as we do in this study. SVC scalability extension is expected to play a major role in providing video services over heterogeneous networks due to the significantly improved rate-distortion efficiency of the H.264. We briefly discuss the main scalability modes of this new H.264 SVC scalability amendment and refer to [2] for detailed information

Temporal scalability with hierarchical B frames
Spatial scalability
Combined scalability
Video sequences
Encoding tools
Encoding setup
Video traffic metrics
TEMPORAL SCALABILITY TRAFFIC ANALYSIS
Temporal layer basics
Results and discussion
SPATIAL SCALABILITY TRAFFIC ANALYSIS
Spatial layer basics
FINE GRANULAR SCALABILITY TRAFFIC ANALYSIS
FGS layer basics
MEDIUM GRAIN SCALABILITY TRAFFIC ANALYSIS
MGS layer basics
COMBINED SCALABILITY TRAFFIC ANALYSIS
Combined spatiotemporal scalability
Combined FGS-temporal scalability
CONCLUSION
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call