Architecture of a Low Latency H.264/AVC Video Codec for Robust ML based Image Classification

Fritjof Steinert,Benno Stabernack

doi:10.1007/s11265-021-01727-2

Abstract

The use of neural networks is considered as the state of the art in the field of image classification. A large number of different networks are available for this purpose, which, appropriately trained, permit a high level of classification accuracy.Typically, these networks are applied to uncompressed image data, since a corresponding training was also carried out using image data of similar high quality. However, if image data contains image errors, the classification accuracy deteriorates drastically. This applies in particular to coding artifacts which occur due to image and video compression. Typical application scenarios for video compression are narrowband transmission channels for which video coding is required but a subsequent classification is to be carried out on the receiver side. In this paper we present a special H.264/Advanced Video Codec (AVC) based video codec that allows certain regions of a picture to be coded with near constant picture quality in order to allow a reliable classification using neural networks, whereas the remaining image will be coded using constant bit rate. We have combined this feature with the ability to run with lowest latency properties, which is usually also required in remote control applications scenarios. The codec has been implemented as a fully hardwired High Definition video capable hardware architecture which is suitable for Field Programmable Gate Arrays.

Highlights

Our application scenario is focused on the detection of persons floating in water as a first step in a successful sea rescue operation
In this paper we have described the architecture of a low complexity H.264/Advanced Video Codec (AVC) video codec, which has been optimized regarding its usage in technical image processing applications
We have focused on two major issues arising from using video compression in these applications, which are mainly providing low latency and avoiding compression artifacts

Summary

Introduction

Our application scenario is focused on the detection of persons floating in water (shipwrecked) as a first step in a successful sea rescue operation. For this purpose an unmanned aerial vehicle (UAV) with a full HD (High Definition) camera pointing downwards is used to record the water surface. The intended UAV has a long flight duration and can fly up to 100 km away from the ground station. The video data is transmitted to the ground station via a radio link with a very limited bandwidth,. An HD camera which captures the scene. 2. A Region of Interest (ROI) generation based on the captured scene e.g. using simple ML methods. 3. Encoding the video with the recognized ROIs. 4. Transmission of the bitstream over a limited wireless air link to the ground station. 5. Decoding the video at the ground station

Region of Interest generation 3 encoder

Influence of Video Coding Quality on Image Processing Algorithms

Video Coding Latency

Hardware Architectures for Video Coding

Encoder

Rate Control

Region of Interest

External Memory Requirements

Decoder

Codec Latency

Results

System Integration

Conclusion and Future Work

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Signal Processing Systems	Publication Date: Jan 31, 2022
Citations: 4	License type: open-access

R Discovery Prime

R Discovery Prime

Architecture of a Low Latency H.264/AVC Video Codec for Robust ML based Image Classification

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Signal Processing Systems

Lead the way for us

Similar Papers

Architecture of a Low Latency H.264/AVC Video Codec for robust ML based Image Classification
Benno Stabernack ... Fritjof Steinert
-
Benno Stabernack, et. al.Benno Stabernack ... Fritjof Steinert
18 Jan 2021
18 Jan 2021

CCITT H.261 compatible mixed bit rate coding of video for ATM networks
S Minami
-
S MinamiS Minami
14 Jun 1992
14 Jun 1992

Encoder-based rate smoothing and quality control for low-delay video coding and communication
Zhihai He ... Chang W Chen
-
Zhihai He, et. al.Zhihai He ... Chang W Chen
07 Jan 2002
07 Jan 2002

Comparison of lossless video and image compression codecs for medical computed tomography datasets
Vy Bui ... Lin-Ching Chang
-
Vy Bui, et. al.Vy Bui ... Lin-Ching Chang
01 Dec 2016
01 Dec 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Architecture of a Low Latency H.264/AVC Video Codec for Robust ML based Image Classification

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Signal Processing Systems