Abstract

Significant advances in video compression systems have been made in the past several decades to satisfy the near-exponential growth of Internet-scale video traffic. From the application perspective, we have identified three major functional blocks, including preprocessing, coding, and postprocessing, which have been continuously investigated to maximize the end-user quality of experience (QoE) under a limited bit rate budget. Recently, artificial intelligence (AI)-powered techniques have shown great potential to further increase the efficiency of the aforementioned functional blocks, both individually and jointly. In this article, we review recent technical advances in video compression systems extensively, with an emphasis on deep neural network (DNN)-based approaches, and then present three comprehensive case studies. On preprocessing, we show a switchable texture-based video coding example that leverages DNN-based scene understanding to extract semantic areas for the improvement of a subsequent video coder. On coding, we present an end-to-end neural video coding framework that takes advantage of the stacked DNNs to efficiently and compactly code input raw videos via fully data-driven learning. On postprocessing, we demonstrate two neural adaptive filters to, respectively, facilitate the in-loop and postfiltering for the enhancement of compressed frames. Finally, a companion website hosting the contents developed in this work can be accessed publicly at https://purdueviper.github.io/dnn-coding/.

Highlights

  • In recent years, Internet traffic has been dominated by a wide range of applications involving video, including video on demand (VOD), live streaming, and ultralow latency real-time communications

  • We present three case studies: 1) switchable texture-based video coding in preprocessing; 2) E2E-NVC; and 3) efficient neural filtering, to provide examples of the potential of deep neural network (DNN) to improve both subjective and objective efficiency over traditional video compression methodologies

  • OVERVIEWOFDNN - BASED VIDEO CODING A number of investigations have shown that DNNs can be used for efficient image/video coding [104]–[107]

Read more

Summary

INTRODUCTION

Internet traffic has been dominated by a wide range of applications involving video, including video on demand (VOD), live streaming, and ultralow latency real-time communications. We discuss recent advances in preprocessing, coding, and postprocessing, with a particular emphasis on the use of DNN-based approaches for efficient video compression. We present three case studies: 1) switchable texture-based video coding in preprocessing; 2) E2E-NVC; and 3) efficient neural filtering, to provide examples of the potential of DNNs to improve both subjective and objective efficiency over traditional video compression methodologies. A number of deep learning-based prefiltering approaches have been adopted for targeted coding optimization These include denoising [29], [30], motion deblurring [31], [32], contrast enhancement [33], edge detection [34], [35], and so on. We limit our focus to saliency- and analysis-/synthesis-based approaches

Saliency-Based Video Preprocessing
Modularized Neural Video Coding
End-to-End Neural Video Coding
In-Loop Filtering
Postfiltering
CASESTUDYFOR PREPROCESSING
Texture Analysis
Experimental Results
Discussion and Future Direction
Framework
Neural Intracoding
Neural Motion Coding and Compensation
Neural Residual Coding
Experimental Comparison
CASESTUDIESFOR POSTPROCESSING
In-Loop Filtering via Guided CNN
Multiframe Postfiltering
Findings
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.