Transform-Based Feature Map Compression Method for Video Coding for Machines (VCM)

Minhun Lee,Jooyoung Lee,Seoung-Jun Oh,Younhee Kim,Se Yoon Jeong,Seungjin Park,Donggyu Sim

doi:10.3390/electronics12194042

Minhun Lee, Jooyoung Lee + Show 5 more

Open Access

https://doi.org/10.3390/electronics12194042

Copy DOI

Abstract

The burgeoning field of machine vision has led to the development by the Moving Picture Experts Group (MPEG) of a new type of compression technology called video coding for machines (VCM), to enhance machine recognition through video information compression. This research proposes a principal component analysis (PCA)-based compression methodology for multi-level feature maps extracted from the feature pyramid network (FPN) structure. Unlike current PCA-based studies that independently carry out PCA for each feature map, our approach employs a generalized basis matrix and mean vector derived from channel correlations by a generalized PCA process to eliminate the need for a PCA process. Further compression is achieved by amalgamating high-dimensional feature maps, capitalizing on the spatial redundancy within these multi-level feature maps. As a result, the proposed VCM encoder forgoes the PCA process, and the generalized data do not incur any compression loss. It only requires compressing the coefficients for each feature map using versatile video coding (VVC). Experimental results demonstrate superior performance by our method over all feature anchors for each machine vision task, as specified by the MPEG-VCM common test conditions, outperforming previous PCA-based feature map compression methods. Notably, it achieved an 89.3% BD-rate reduction for instance segmentation tasks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Transform-Based Feature Map Compression Method for Video Coding for Machines (VCM)

Abstract

Talk to us

Similar Papers

More From: Electronics

Lead the way for us

Journal: Electronics	Publication Date: Sep 26, 2023
License type: CC BY 4.0

Similar Papers

Compression of Multiscale Features of FPN with Channel-Wise Reduction for VCM
Dong-Ha Kim ... Byung Tae Oh
Electronics | VOL. 12
Dong-Ha Kim, et. al.Dong-Ha Kim ... Byung Tae Oh
21 Jun 2023
Electronics | VOL. 12

A novel feature map compression method based on feature transformation for VCM
Minhun Lee ... Kwang-Deok Seo
-
Minhun Lee, et. al.Minhun Lee ... Kwang-Deok Seo
26 Mar 2023
26 Mar 2023

Fast Mode Decision Method of Multiple Weighted Bi-Predictions Using Lightweight Multilayer Perceptron in Versatile Video Coding
Taesik Lee ... Dongsan Jun
Electronics | VOL. 12
Taesik Lee, et. al.Taesik Lee ... Dongsan Jun
15 Jun 2023
Electronics | VOL. 12

Deep Image Compression Toward Machine Vision: A Unified Optimization Framework
Shurun Wang ... Zhao Wang
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 33
Shurun Wang, et. al.Shurun Wang ... Zhao Wang
01 Jun 2023
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 33

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Transform-Based Feature Map Compression Method for Video Coding for Machines (VCM)

Abstract

Talk to us

Similar Papers

More From: Electronics