Abstract
Significant advances in video compression systems have been made in the past several decades to satisfy the near-exponential growth of Internet-scale video traffic. From the application perspective, we have identified three major functional blocks, including preprocessing, coding, and postprocessing, which have been continuously investigated to maximize the end-user quality of experience (QoE) under a limited bit rate budget. Recently, artificial intelligence (AI)-powered techniques have shown great potential to further increase the efficiency of the aforementioned functional blocks, both individually and jointly. In this article, we review recent technical advances in video compression systems extensively, with an emphasis on deep neural network (DNN)-based approaches, and then present three comprehensive case studies. On preprocessing, we show a switchable texture-based video coding example that leverages DNN-based scene understanding to extract semantic areas for the improvement of a subsequent video coder. On coding, we present an end-to-end neural video coding framework that takes advantage of the stacked DNNs to efficiently and compactly code input raw videos via fully data-driven learning. On postprocessing, we demonstrate two neural adaptive filters to, respectively, facilitate the in-loop and postfiltering for the enhancement of compressed frames. Finally, a companion website hosting the contents developed in this work can be accessed publicly at https://purdueviper.github.io/dnn-coding/.
Highlights
In recent years, Internet traffic has been dominated by a wide range of applications involving video, including video on demand (VOD), live streaming, and ultralow latency real-time communications
We present three case studies: 1) switchable texture-based video coding in preprocessing; 2) E2E-NVC; and 3) efficient neural filtering, to provide examples of the potential of deep neural network (DNN) to improve both subjective and objective efficiency over traditional video compression methodologies
OVERVIEWOFDNN - BASED VIDEO CODING A number of investigations have shown that DNNs can be used for efficient image/video coding [104]–[107]
Summary
Internet traffic has been dominated by a wide range of applications involving video, including video on demand (VOD), live streaming, and ultralow latency real-time communications. We discuss recent advances in preprocessing, coding, and postprocessing, with a particular emphasis on the use of DNN-based approaches for efficient video compression. We present three case studies: 1) switchable texture-based video coding in preprocessing; 2) E2E-NVC; and 3) efficient neural filtering, to provide examples of the potential of DNNs to improve both subjective and objective efficiency over traditional video compression methodologies. A number of deep learning-based prefiltering approaches have been adopted for targeted coding optimization These include denoising [29], [30], motion deblurring [31], [32], contrast enhancement [33], edge detection [34], [35], and so on. We limit our focus to saliency- and analysis-/synthesis-based approaches
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.