Modality-Induced Transfer-Fusion Network for RGB-D and RGB-T Salient Object Detection

Gang Chen,Xiangchao Meng,Xiongli Chai,Feng Shao,Yo-Sung Ho,Hangwei Chen,Qiuping Jiang

doi:10.1109/tcsvt.2022.3215979

Abstract

The ability of capturing the complementary information of multi-modality data is critical to the development of multi-modality salient object detection (SOD). Most of existing studies attempt to integrate multi-modality information through various fusion strategies. However, most of these methods ignore the inherent differences in multi-modality data, resulting in poor performance when dealing with some challenging scenarios. In this paper, we propose a novel Modality-Induced Transfer-Fusion Network (MITF-Net) for RGB-D and RGB-T SOD by fully exploring the complementarity in multi-modality data. Specifically, we first deploy a modality transfer fusion (MTF) module to bridge the semantic gap between single and multi-modality data, and then mine the cross-modality complementarity based on point-to-point structural similarity information. Then, we design a cycle-separated attention (CSA) module to optimize the cross-layer information recurrently, and measure the effectiveness of cross-layer features through point-wise convolution-based multi-scale channel attention. Furthermore, we refine the boundaries in the decoding stage to obtain high-quality saliency maps with sharp boundaries. Extensive experiments on 13 RGB-D and RGB-T SOD datasets show that the proposed MITF-Net achieves a competitive and excellent performance.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Modality-Induced Transfer-Fusion Network for RGB-D and RGB-T Salient Object Detection

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society

Lead the way for us

Journal: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society	Publication Date: Apr 1, 2023
Citations: 24

Similar Papers

UTDNet: A unified triplet decoder network for multimodal salient object detection
Fushuo Huo ... Song Guo
Neural networks : the official journal of the International Neural Network Society | VOL. 170
Fushuo Huo, et. al.Fushuo Huo ... Song Guo
24 Nov 2023
Neural networks : the official journal of the International Neural Network Society | VOL. 170

PATNet: Patch-to-pixel attention-aware transformer network for RGB-D and RGB-T salient object detection
Mingfeng Jiang ... Xian Fang
Knowledge Based Systems | VOL. 291
Mingfeng Jiang, et. al.Mingfeng Jiang ... Xian Fang
02 Mar 2024
Knowledge Based Systems | VOL. 291

PSNet: Parallel symmetric network for RGB-T salient object detection
Hongbo Bi ... Xiufang Wang
Neurocomputing | VOL. 511
Hongbo Bi, et. al.Hongbo Bi ... Xiufang Wang
13 Sep 2022
Neurocomputing | VOL. 511

Depth-aware inverted refinement network for RGB-D salient object detection
Lina Gao ... Mingzhu Xu
Neurocomputing | VOL. 518
Lina Gao, et. al.Lina Gao ... Mingzhu Xu
12 Nov 2022
Neurocomputing | VOL. 518

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Modality-Induced Transfer-Fusion Network for RGB-D and RGB-T Salient Object Detection

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society