Abstract

Versatile Video Coding (VVC), a.k.a. ITU-T H.266 | ISO/IEC 23090-3, is the new generation video coding standard that has just been finalized by the Joint Video Experts Team (JVET) of ITU-T VCEG and ISO/IEC MPEG at its <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$19^{\mathrm {th}}$ </tex-math></inline-formula> meeting ending on July 1, 2020. This paper gives an overview of the VVC high-level syntax (HLS), which forms its system and transport interface. Comparisons to the HLS designs in High Efficiency Video Coding (HEVC) and Advanced Video Coding (AVC), the previous major video coding standards, are included. When discussing new HLS features introduced into VVC or differences relative to HEVC and AVC, the reasoning behind the design differences and the benefits they bring are described. The HLS of VVC enables newer and more versatile use cases such as video region extraction, composition and merging of content from multiple coded video bitstreams, and viewport-adaptive 360° immersive media.

Highlights

  • C ODED video content consists of a series of data structures that contain header syntax and supplemental information in addition to the compressed bits that directly represent color component samples

  • With the use of a Picture Header (PH) in Versatile Video Coding (VVC), the picture order count (POC) least significant bits (LSBs), which are used for deriving the POC value and have the same value for all slices of a picture, are signaled in the PH, as contrasted with High Efficiency Video Coding (HEVC) where they are signaled in the Slice Header (SH)

  • VVC is the first video coding standard that includes subpictures as a feature, which can provide a functionality that was previously specified in a version 2 extension of HEVC using metadata and encoder constraints for what is known as motion-constrained tile sets (MCTSs), but is designed in a different way to have better coding efficiency and to be friendlier for usage in application systems

Read more

Summary

INTRODUCTION

C ODED video content consists of a series of data structures that contain header syntax and supplemental information in addition to the compressed bits that directly represent color component samples The handling of these data structures forms the system interface for operation of an encoder or decoder within a system environment, and this interface needs to support the functionalities that will be used by the system to enable the features of the application. VVC inherited much of its HLS design from the preceding AVC and HEVC standards These include the NAL-unitbased syntax structure, the hierarchical syntax and data unit structure, the VUI and SEI message mechanism, and the video buffering model based on a hypothetical reference decoder (HRD). More details on these can be found in [19]–[21]

Bitstream Structure
NAL Units
Random Access Support
Reference Picture Management
Temporal Scalability Support
Externally Specified Sub-Profiles
NEW NAL UNITS AND SYNTAX STRUCTURES
REFERENCE PICTURE MANAGEMENT
Reference Picture List Signaling and Reference Picture Marking
Slices
Subpictures
Order of VCL NAL Units Within a Picture
PICTURE RESOLUTION CHANGE WITHIN A SEQUENCE
MULTI-LAYER AND SCALABILITY SUPPORT
Alternative Timing for Random Access
Bitstream Splicing
HRD for Temporal Sublayers
General Sub-Bitstream Extraction Processes
HRD for Subpictures
VUI AND SEI
XIII. CONCLUSION

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.