Abstract
Versatile Video Coding (VVC), a.k.a. ITU-T H.266 | ISO/IEC 23090-3, is the new generation video coding standard that has just been finalized by the Joint Video Experts Team (JVET) of ITU-T VCEG and ISO/IEC MPEG at its <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$19^{\mathrm {th}}$ </tex-math></inline-formula> meeting ending on July 1, 2020. This paper gives an overview of the VVC high-level syntax (HLS), which forms its system and transport interface. Comparisons to the HLS designs in High Efficiency Video Coding (HEVC) and Advanced Video Coding (AVC), the previous major video coding standards, are included. When discussing new HLS features introduced into VVC or differences relative to HEVC and AVC, the reasoning behind the design differences and the benefits they bring are described. The HLS of VVC enables newer and more versatile use cases such as video region extraction, composition and merging of content from multiple coded video bitstreams, and viewport-adaptive 360° immersive media.
Highlights
C ODED video content consists of a series of data structures that contain header syntax and supplemental information in addition to the compressed bits that directly represent color component samples
With the use of a Picture Header (PH) in Versatile Video Coding (VVC), the picture order count (POC) least significant bits (LSBs), which are used for deriving the POC value and have the same value for all slices of a picture, are signaled in the PH, as contrasted with High Efficiency Video Coding (HEVC) where they are signaled in the Slice Header (SH)
VVC is the first video coding standard that includes subpictures as a feature, which can provide a functionality that was previously specified in a version 2 extension of HEVC using metadata and encoder constraints for what is known as motion-constrained tile sets (MCTSs), but is designed in a different way to have better coding efficiency and to be friendlier for usage in application systems
Summary
C ODED video content consists of a series of data structures that contain header syntax and supplemental information in addition to the compressed bits that directly represent color component samples The handling of these data structures forms the system interface for operation of an encoder or decoder within a system environment, and this interface needs to support the functionalities that will be used by the system to enable the features of the application. VVC inherited much of its HLS design from the preceding AVC and HEVC standards These include the NAL-unitbased syntax structure, the hierarchical syntax and data unit structure, the VUI and SEI message mechanism, and the video buffering model based on a hypothetical reference decoder (HRD). More details on these can be found in [19]–[21]
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
More From: IEEE Transactions on Circuits and Systems for Video Technology
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.