Machine Learning Based Efficient QT-MTT Partitioning Scheme for VVC Intra Encoders

Alexandre Tissier,Souhaiel Belhadj Dit Mdalsi,Daniel Menard,Wassim Hamidouche,Jarno Vanne,Franck Galpin

doi:10.1109/tcsvt.2022.3232385

Abstract

The next-generation Versatile Video Coding (VVC) standard introduces a new Multi-Type Tree (MTT) block partitioning structure that supports Binary-Tree (BT) and Ternary-Tree (TT) splits in both vertical and horizontal directions. This new approach leads to five possible splits at each block depth. It thereby improves the coding efficiency of VVC over that of the preceding High Efficiency Video Coding (HEVC) standard, which only supports Quad-Tree (QT) partitioning with a single split per block depth. However, MTT also has brought a considerable impact on encoder computational complexity. This paper proposes a two-stage learning-based technique to tackle the complexity overhead of MTT in VVC intra encoders. In our scheme, the input block is first processed by a Convolutional Neural Network (CNN) to predict its spatial features through a vector of probabilities describing the partition at each 4×4 edge. Subsequently, a Decision Tree (DT) model leverages this vector of spatial features to predict the most likely splits at each block. Finally, based on this prediction, only the <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">N</i> most likely splits are processed by the Rate-Distortion (RD) process of the encoder. In order to train our CNN and DT models on a wide range of image contents, we also propose a public VVC frame partitioning dataset based on existing image dataset encoded with the VVC reference software encoder. Our solution relying on the top-3 configuration reaches 47.4% complexity reduction for a negligible bitrate increase of 0.79%. A top-2 configuration enables a higher complexity reduction of 70.4% for 2.49% bitrate loss. These results emphasize a better trade-off between VTM intra-coding efficiency and complexity reduction compared to the state-of-the-art solutions. The source code of the proposed method and the training dataset are made publicly available at GitHub.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Machine Learning Based Efficient QT-MTT Partitioning Scheme for VVC Intra Encoders

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology

Lead the way for us

Journal: IEEE Transactions on Circuits and Systems for Video Technology	Publication Date: Aug 1, 2023
Citations: 19

Similar Papers

Partition Map Prediction for Fast Block Partitioning in VVC Intra-Frame Coding
Aolin Feng ... Feng Wu
IEEE Transactions on Image Processing | VOL. 32
Aolin Feng, et. al.Aolin Feng ... Feng Wu
01 Jan 2023
IEEE Transactions on Image Processing | VOL. 32

Tunable VVC Frame Partitioning based on Lightweight Machine Learning.
Thomas Amestoy ... Cyril Bergeron
IEEE Transactions on Image Processing | VOL. 29
Thomas Amestoy, et. al.Thomas Amestoy ... Cyril Bergeron
06 Sep 2019
IEEE Transactions on Image Processing | VOL. 29

CNN-based Partitioning Structure Prediction for VVC Intra Speedup: Bottom-Up-based and Top-Down-based
Yue Li ... Li Zhang
-
Yue Li, et. al.Yue Li ... Li Zhang
28 May 2022
28 May 2022

Medical Image Compression Method Using Lightweight Multi-Layer Perceptron for Mobile Healthcare Applications
Taesik Lee ... Kugjin Yun
Computers, Materials & Continua | VOL. 70
Taesik Lee, et. al.Taesik Lee ... Kugjin Yun
01 Jan 2021
Computers, Materials & Continua | VOL. 70

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Machine Learning Based Efficient QT-MTT Partitioning Scheme for VVC Intra Encoders

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology