Sparse Representation With Spatio-Temporal Online Dictionary Learning for Promising Video Coding.

Wenrui Dai,Yangmei Shen,Hongkai Xiong,Chang Wen Chen,Xin Tang,Junni Zou

doi:10.1109/tip.2016.2594490

Wenrui Dai, Yangmei Shen + Show 4 more

https://doi.org/10.1109/tip.2016.2594490

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

Classical dictionary learning methods for video coding suffer from high computational complexity and interfered coding efficiency by disregarding its underlying distribution. This paper proposes a spatio-temporal online dictionary learning (STOL) algorithm to speed up the convergence rate of dictionary learning with a guarantee of approximation error. The proposed algorithm incorporates stochastic gradient descents to form a dictionary of pairs of 3D low-frequency and high-frequency spatio-temporal volumes. In each iteration of the learning process, it randomly selects one sample volume and updates the atoms of dictionary by minimizing the expected cost, rather than optimizes empirical cost over the complete training data, such as batch learning methods, e.g., K-SVD. Since the selected volumes are supposed to be independent identically distributed samples from the underlying distribution, decomposition coefficients attained from the trained dictionary are desirable for sparse representation. Theoretically, it is proved that the proposed STOL could achieve better approximation for sparse representation than K-SVD and maintain both structured sparsity and hierarchical sparsity. It is shown to outperform batch gradient descent methods (K-SVD) in the sense of convergence speed and computational complexity, and its upper bound for prediction error is asymptotically equal to the training error. With lower computational complexity, extensive experiments validate that the STOL-based coding scheme achieves performance improvements than H.264/AVC or High Efficiency Video Coding as well as existing super-resolution-based methods in rate-distortion performance and visual quality.

Full Text

Published Version

Check institute access

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Sparse Representation With Spatio-Temporal Online Dictionary Learning for Promising Video Coding.

Abstract

Published Version

Talk to us

Similar Papers

More From: IEEE transactions on image processing : a publication of the IEEE Signal Processing Society

Lead the way for us

Journal: IEEE transactions on image processing : a publication of the IEEE Signal Processing Society	Publication Date: Jul 27, 2016
Citations: 11

Similar Papers

Fast Motion Estimation Based on Confidence Interval
Nan Hu ... En-Hui Yang
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 24
Nan Hu, et. al. Nan Hu ... En-Hui Yang
01 Aug 2014
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 24

Performance Comparison of Emerging EVC and VVC Video Coding Standards with HEVC and AV1
Dan Grois ... Alex Giladi
SMPTE Motion Imaging Journal | VOL. 130
Dan Grois, et. al.Dan Grois ... Alex Giladi
01 May 2021
SMPTE Motion Imaging Journal | VOL. 130

Multiple classifier-based fast coding unit partition for intra coding in future video coding
Zongju Peng ... Mei Yu
Signal Processing: Image Communication | VOL. 78
Zongju Peng, et. al.Zongju Peng ... Mei Yu
02 Jul 2019
Signal Processing: Image Communication | VOL. 78

VLSI architectures design for encoders of High Efficiency Video Coding (HEVC) standard

-

01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Sparse Representation With Spatio-Temporal Online Dictionary Learning for Promising Video Coding.

Abstract

Published Version

Talk to us

Similar Papers

More From: IEEE transactions on image processing : a publication of the IEEE Signal Processing Society