Abstract

High efficiency video (HEVC) coding made its mark as a codec which compress with low bit rate than its preceding codec that is H.264, but the factor that stop HEVC from many applications is its complex encoding procedure. The rate distortion optimisation (RDO) cost calculation in HEVC consume complex calculations. In this paper, we propose a method to cross out the issue of complex calculations by replacing the traditional inter-prediction procedure of brute force search for RDO by a deep convolutional neural network to predict and perform this process. In the first step, the modelling of the deep depth decision algorithm is done with optimum specifications using convolutional neural network (CNN). In the next step, the model is designed and trained with dataset and validated. The trained model is tested by pipelining it to the original HEVC encoder to check its performance. We also evaluate the efficiency of the model by comparing the average time of encoding for various resolution video input. The testing is done with mutually independent input to maintain the accuracy of the system. The system shows a substantial saving in encoding time that proves the complexity reduction in HEVC.

Highlights

  • Video compression is an area to explore while considering the flourish of video acquisition devices, social media, live transfer of videos etc

  • The computational complexity of HEVC is a matter of discussion because of its Rate distortion optimisation [1] (RDO) cost calculation for coding tree unit [2] (CTU)

  • The discrete cosine transform (DCT) transform converts spatial domain into frequency domain, but it does not change the information in the block; if the DCT is calculated with perfect precision, the original block can be recovered clearly by applying the inverse discreet cosine transform” [5] “H.263 [7]is a popular video compression standard for low-bit-rate compressed format focusing on videoconferencing

Read more

Summary

Introduction

Video compression is an area to explore while considering the flourish of video acquisition devices, social media, live transfer of videos etc. The DCT transform converts spatial domain into frequency domain, but it does not change the information in the block; if the DCT is calculated with perfect precision, the original block can be recovered clearly by applying the inverse discreet cosine transform” [5] “H.263 [7]is a popular video compression standard for low-bit-rate compressed format focusing on videoconferencing. It was standardized by the organisation ITU-T, Video Coding Experts Group (VCEG) in 1995/1996”[4]. The aim behind AVC was to transfer video in low bit rate with better efficiency for UHD videos to its adaption of it

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call