RGB-D Data Compression via Bi-Directional Cross-Modal Prior Transfer and Enhanced Entropy Modeling

Yuyu Xu,Pingping Zhang,Minghui Chen,Qiudan Zhang,Wenhui Wu,Yun Zhang,Xu Wang

doi:10.1145/3702997

Abstract

RGB-D data, being homogeneous cross-modal data, demonstrates significant correlations among data elements. However, current research focuses only on a unidirectional pattern of cross-modal contextual information, neglecting the exploration of bidirectional relationships in the compression field. Thus, we propose a joint RGB-D compression scheme, which is combined with Bi-directional Cross-modal Prior Transfer (Bi-CPT) modules and a Bi-directional Cross-modal Enhanced Entropy (Bi-CEE) model. The Bi-CPT module is designed for compact representations of cross-modal features, effectively eliminating spatial and modality redundancies at different granularity levels. In contrast to the traditional entropy models, our proposed Bi-CEE model not only achieves spatial-channel contextual adaptation through partitioning RGB and depth features but also incorporates information from other modalities as prior to enhance the accuracy of probability estimation for latent variables. Furthermore, this model enables parallel multi-stage processing to accelerate coding. Experimental results demonstrate the superiority of our proposed framework over the current compression scheme, outperforming both rate-distortion performance and downstream tasks, including surface reconstruction and semantic segmentation. The source code will be available at https://github.com/xyy7/Learning-based-RGB-D-Image-Compression .

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

RGB-D Data Compression via Bi-Directional Cross-Modal Prior Transfer and Enhanced Entropy Modeling

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications

Lead the way for us

Similar Papers

RDFNet: RGB-D Multi-level Residual Feature Fusion for Indoor Semantic Segmentation
Seungyong Lee ... Ki-Sang Hong
-
Seungyong Lee, et. al.Seungyong Lee ... Ki-Sang Hong
01 Oct 2017
01 Oct 2017

The Network of Attention-Aware Multimodal fusion for RGB-D Indoor Semantic Segmentation Method
Qiankun Zhao ... Yingcai Wan
-
Qiankun Zhao, et. al.Qiankun Zhao ... Yingcai Wan
15 Aug 2022
15 Aug 2022

CANet: Co-attention network for RGB-D semantic segmentation
Hao Zhou ... Xianglong Wen
Pattern Recognition | VOL. 124
Hao Zhou, et. al.Hao Zhou ... Xianglong Wen
29 Nov 2021
Pattern Recognition | VOL. 124

A Novel Semantic Segmentation Algorithm for RGB-D Images Based on Non-Symmetry and Anti-Packing Pattern Representation Model
Yunping Zheng ... Guichuang Zhong
IEEE Access | VOL. 11
Yunping Zheng, et. al.Yunping Zheng ... Guichuang Zhong
01 Jan 2023
IEEE Access | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

RGB-D Data Compression via Bi-Directional Cross-Modal Prior Transfer and Enhanced Entropy Modeling

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications