Efficient Algorithm Adaptations and Fully Parallel Hardware Architecture of H.265/HEVC Intra Encoder

Yuanzhi Zhang,Chao Lu

doi:10.1109/tcsvt.2018.2878399

Abstract

The growing demand for high-performance ultra-high-definition video coding leads to H.265/high-efficiency video coding (HEVC), where the increased computational complexity and data/timing dependence hinder its coding throughput. To address these challenges, this paper presents four algorithm adaptations and a fully parallel hardware architecture for an H.265/HEVC intra encoder. To the best of our knowledge, this is the first fully parallel H.265/HEVC intra encoder. This design supports 35 prediction modes and all coding tree unit partitions. All PUs are independently processed in four prediction engines for high parallelism. An appropriate set of intra prediction modes, RDO candidates, and CABAC rate estimate instances is assigned to each prediction engine, where internal computational tasks are pipelined and scheduled to maximize the processing throughput. Compared with the HM-15.0 software, the proposed algorithm adaptations lead to a reduction of 27% in computational workload, while the average BD-rate and BD-PSNR are 4.39% and -0.21 dB, respectively. This BD-rate is lower than the existing designs with the same video resolution. FPGA implementation of the proposed design shows that it operates at 120 MHz and supports 45 fps of 1080P video sequences using 201-K logic elements and 120-KB on-chip SRAM. ASIC implementation of the proposed design in TSMC 90-nm technology shows that its clock frequency reaches 320 MHz with a hardware gate count of 2288 K, and that it supports real-time encoding of 30 fps of 4-K video sequences. Compared with the state-of-the-art designs, our proposed design demonstrates advantages in computational complexity, bit rate, video quality, throughput, reliability, and flexibility.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Efficient Algorithm Adaptations and Fully Parallel Hardware Architecture of H.265/HEVC Intra Encoder

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society

Lead the way for us

Journal: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society	Publication Date: Nov 1, 2019
Citations: 44

Similar Papers

The impact of bitrate and GOP pattern on the video quality of H.265/HEVC compression standard
Jinheng Xu ... Weizheng Jin
-
Jinheng Xu, et. al.Jinheng Xu ... Weizheng Jin
01 Sep 2018
01 Sep 2018

Fast CU Partition Decision Based on Texture for H.266/VVC
Qiuwen Zhang ... Antonio J Peña
Scientific programming | VOL. 2021
Qiuwen Zhang, et. al.Qiuwen Zhang ... Antonio J Peña
24 May 2021
Scientific programming | VOL. 2021

An improved R-λ rate control model based on joint spatial-temporal domain information and HVS characteristics
Zeming Zhao ... Feiran Zhang
Multimedia Tools and Applications | VOL. 80
Zeming Zhao, et. al.Zeming Zhao ... Feiran Zhang
02 Sep 2020
Multimedia Tools and Applications | VOL. 80

Context adaptive mode sorting for fast HEVC mode decision
S G Blasi ... E M Hung
-
S G Blasi, et. al.S G Blasi ... E M Hung
01 Sep 2015
01 Sep 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient Algorithm Adaptations and Fully Parallel Hardware Architecture of H.265/HEVC Intra Encoder

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society