Learned scalable video coding for humans and machines

Hadi Hadizadeh,Ivan V Bajić

doi:10.1186/s13640-024-00657-w

Abstract

Video coding has traditionally been developed to support services such as video streaming, videoconferencing, digital TV, and so on. The main intent was to enable human viewing of the encoded content. However, with the advances in deep neural networks (DNNs), encoded video is increasingly being used for automatic video analytics performed by machines. In applications such as automatic traffic monitoring, analytics such as vehicle detection, tracking and counting, would run continuously, while human viewing could be required occasionally to review potential incidents. To support such applications, a new paradigm for video coding is needed that will facilitate efficient representation and compression of video for both machine and human use in a scalable manner. In this manuscript, we introduce an end-to-end learnable video codec that supports a machine vision task in its base layer, while its enhancement layer, together with the base layer, supports input reconstruction for human viewing. The proposed system is constructed based on the concept of conditional coding to achieve better compression gains. Comprehensive experimental evaluations conducted on four standard video datasets demonstrate that our framework outperforms both state-of-the-art learned and conventional video codecs in its base layer, while maintaining comparable performance on the human vision task in its enhancement layer.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learned scalable video coding for humans and machines

Abstract

Talk to us

Similar Papers

More From: EURASIP Journal on Image and Video Processing

Lead the way for us

Journal: EURASIP Journal on Image and Video Processing	Publication Date: Nov 14, 2024
License type: CC BY 4.0

Similar Papers

Scalable combined H.264/distributed video coding
Rui Lv ... Tianquan James Deng
COMPEL - The international journal for computation and mathematics in electrical and electronic engineering | VOL. 29
Rui Lv, et. al.Rui Lv ... Tianquan James Deng
09 Mar 2010
COMPEL - The international journal for computation and mathematics in electrical and electronic engineering | VOL. 29

A Streaming Method for Efficient Bandwidth Utilization Using QoS Control Function of LTE
Yasuhiro Nagai ... Takao Okamawari
-
Yasuhiro Nagai, et. al.Yasuhiro Nagai ... Takao Okamawari
01 May 2016
01 May 2016

Rate control for fully fine-grained scalable video coders
Josep Prades-Nebot ... Edward J Delp Iii
-
Josep Prades-Nebot, et. al.Josep Prades-Nebot ... Edward J Delp Iii
07 Jan 2002
07 Jan 2002

Objective quality definition of scalable video coding and its application for optimal streaming of FGS-coded videos
Yuanqing He ... Tianyun Huang
Computer Communications | VOL. 32
Yuanqing He, et. al.Yuanqing He ... Tianyun Huang
12 Sep 2008
Computer Communications | VOL. 32

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learned scalable video coding for humans and machines

Abstract

Talk to us

Similar Papers

More From: EURASIP Journal on Image and Video Processing