BMDENet: Bi-Directional Modality Difference Elimination Network for Few-Shot RGB-T Semantic Segmentation

Ying Zhao,Yunhui Yan,Yiming Zhang,Kechen Song

doi:10.1109/tcsii.2023.3278941

Abstract

Few-shot semantic segmentation (FSS) aims to segment the target prospects of query images using a few labeled support samples. Compared with the fully-supervised methods, FSS has a greater ability to generalize to unseen classes and reduce the pressure to label large pixel-level datasets. To cope with the complex outdoor lighting environment, we introduce the thermal infrared images (T) to the FSS task. However, the existing RGB-T FSS methods all ignore the differences between various modalities for direct fusion, which may hinder cross-modal information interaction. Also considering the effect of successive downsampling on the results, we propose a bi-directional modality difference elimination network (BMDENet) to boost the segmentation performance. Concretely, the bi-directional modality difference elimination module (BMDEM) reduces the heterogeneity between RGB and thermal images in the prototype space. The residual attention fusion module (RAFM) mines the bimodal features to fully fuse the cross-modal information. In addition, the mainstay and subsidiary enhancement module (MSEM) enhances the fusion features for the existing problem of the advanced model. Extensive experiments on Tokyo Multi-Spectral-4i dataset prove that BMDENet achieves the state-of-the-art on both 1-and 5-shot settings.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

BMDENet: Bi-Directional Modality Difference Elimination Network for Few-Shot RGB-T Semantic Segmentation

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems II: Analog and Digital Signal Processing

Lead the way for us

Journal: IEEE Transactions on Circuits and Systems II: Analog and Digital Signal Processing	Publication Date: Nov 1, 2023
Citations: 1

Similar Papers

Residual spatial fusion network for RGB-thermal semantic segmentation
Ping Li ... Xianghua Xu
Neurocomputing | VOL. 595
Ping Li, et. al.Ping Li ... Xianghua Xu
22 May 2024
Neurocomputing | VOL. 595

A deep learning approach combining DeepLabV3+ and improved YOLOv5 to detect dairy cow mastitis
Yanchao Wang ... Gang Liu
Computers and Electronics in Agriculture | VOL. 216
Yanchao Wang, et. al.Yanchao Wang ... Gang Liu
09 Dec 2023
Computers and Electronics in Agriculture | VOL. 216

ABMDRNet: Adaptive-weighted Bi-directional Modality Difference Reduction Network for RGB-T Semantic Segmentation
Qiang Zhang ... Jungong Han
-
Qiang Zhang, et. al.Qiang Zhang ... Jungong Han
01 Jun 2021
01 Jun 2021

GCNet: Grid-like context-aware network for RGB-thermal semantic segmentation
Jinfu Liu ... Ting Luo
Neurocomputing | VOL. 506
Jinfu Liu, et. al.Jinfu Liu ... Ting Luo
22 Jul 2022
Neurocomputing | VOL. 506

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

BMDENet: Bi-Directional Modality Difference Elimination Network for Few-Shot RGB-T Semantic Segmentation

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems II: Analog and Digital Signal Processing