Deep Multi-Task Learning Based Fast Intra-Mode Decision for Versatile Video Coding

Zheng Liu,Ying Chen,Honggang Qi,Mai Xu,Kaijin Wei,Tianyi Li

doi:10.1109/tcsvt.2023.3262733

Abstract

The latest Versatile Video Coding (VVC) standard has significantly coding efficiency improvement compared with its ancestor High Efficiency Video Coding (HEVC) standard, but at the expense of over-high complexity. As measured by the VVC test model (VTM), the intra-mode comparison and selection in the rate-distortion optimization (RDO) search consume most of the encoding time. In this paper, we propose a deep multi-task learning based fast intra-mode decision approach via adaptively pruning off most redundant modes. First, we create a large-scale intra-mode database for VVC, including both normal angular modes and the newly introduced tools, i.e., intra sub-partition (ISP) and matrix-based intra prediction (MIP). Next, we propose a multi-task intra-mode decision network (MID-Net) model to effectively predict the most probable angular modes and whether to skip ISP and MIP modes. Then, a fast intra-coding workflow is designed accordingly, involving rough mode decision (RMD) acceleration and candidate mode list (CML) pruning. For the workflow output, the learning-oriented probability and the statistics-oriented probability are synthesized together to further improve the prediction accuracy, ensuring that only unnecessary intra-modes are skipped. Finally, experimental results show that our approach can significantly reduce 40.48% of encoding time of VVC intra-coding with negligible rate-distortion degradation, outperforming other state-of-the-art approaches.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Deep Multi-Task Learning Based Fast Intra-Mode Decision for Versatile Video Coding

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society

Lead the way for us

Journal: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society	Publication Date: Oct 1, 2023
Citations: 3

Similar Papers

Performance Comparison of Weak Filtering in HEVC and VVC
Junghyun Lee ... Jechang Jeong
Electronics | VOL. 9
Junghyun Lee, et. al.Junghyun Lee ... Jechang Jeong
09 Jun 2020
Electronics | VOL. 9

Perceptual Quality Assessment of HEVC and VVC Standards for 8K Video
Charles Bonnineau ... Olivier Deforges
IRE Transactions on Broadcasting | VOL. 68
Charles Bonnineau, et. al.Charles Bonnineau ... Olivier Deforges
01 Mar 2022
IRE Transactions on Broadcasting | VOL. 68

Medical Image Compression Method Using Lightweight Multi-Layer Perceptron for Mobile Healthcare Applications
Taesik Lee ... Kugjin Yun
Computers, materials & continua | VOL. 70
Taesik Lee, et. al.Taesik Lee ... Kugjin Yun
01 Jan 2021
Computers, materials & continua | VOL. 70

Fast Affine Motion Estimation for Versatile Video Coding (VVC) Encoding
Sang-Hyo Park ... Je-Won Kang
IEEE access : practical innovations, open solutions | VOL. 7
Sang-Hyo Park, et. al.Sang-Hyo Park ... Je-Won Kang
01 Jan 2019
IEEE access : practical innovations, open solutions | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep Multi-Task Learning Based Fast Intra-Mode Decision for Versatile Video Coding

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society