Transformer-based cross-modality interaction guidance network for RGB-T salient object detection

Jincheng Luo,Yongjun Li,Bo Li,Xinru Zhang,Chaoyue Li,Zhimin Chenjin,Jingyi He,Yifei Liang

doi:10.1016/j.neucom.2024.128149

Abstract

Exploring more effective multimodal fusion strategies is still challenging for RGB-T salient object detection (SOD). Most RGB-T SOD methods tend to focus on the strategy of acquiring modal complementary features by utilizing foreground information while ignoring the importance of background information for salient object localization. In addition, feature fusion without information filtering may introduce more noise. To solve these problems, this paper proposes a new cross-modal interaction guidance network (CIGNet) for RGB-T saliency object detection. Specifically, we construct a transformer-based dual-stream encoder to extract multimodal features. In the decoder, we propose an attention mechanism-based modal information complementary module (MICM) for capturing cross-modal complementary information for global comparison and salient object localization. Based on the MICM features, we design a multi-scale adaptive fusion module (MAFM) to find the optimal salient region of the multi-scale fusion process and reduce redundant features. In order to enhance the completeness of salient features after multi-scale feature fusion, this paper proposes the saliency region mining module (SRMM), which corrects the features in the boundary neighborhood by exploiting the differences between foreground and background pixels and the boundary. Comparisons with other state-of-the-art methods on three RGB-T datasets and five RGB-D datasets, the experimental results demonstrate the superiority and extensiveness of the proposed CIGNet.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Transformer-based cross-modality interaction guidance network for RGB-T salient object detection

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Journal: Neurocomputing	Publication Date: Jul 4, 2024
Citations: 1

Similar Papers

MLBSNet: Mutual Learning and Boosting Segmentation Network for RGB-D Salient Object Detection
Chenxing Xia ... Jingjing Wang
Electronics | VOL. 13
Chenxing Xia, et. al.Chenxing Xia ... Jingjing Wang
10 Jul 2024
Electronics | VOL. 13

Depth-aware salient object segmentation
Le Vu Ha ... Tran Hoang Tung
VNU Journal of Science: Computer Science and Communication Engineering | VOL. 36
Le Vu Ha, et. al.Le Vu Ha ... Tran Hoang Tung
07 Oct 2020
VNU Journal of Science: Computer Science and Communication Engineering | VOL. 36

SLMSF-Net: A Semantic Localization and Multi-Scale Fusion Network for RGB-D Salient Object Detection.
Yanbin Peng ... Mingkun Feng
Sensors (Basel, Switzerland) | VOL. 24
Yanbin Peng, et. al.Yanbin Peng ... Mingkun Feng
08 Feb 2024
Sensors (Basel, Switzerland) | VOL. 24

Learning event guided network for salient object detection
Xiurong Jiang ... Lin Zhu
Pattern Recognition Letters | VOL. 151
Xiurong Jiang, et. al.Xiurong Jiang ... Lin Zhu
01 Nov 2021
Pattern Recognition Letters | VOL. 151

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Transformer-based cross-modality interaction guidance network for RGB-T salient object detection

Abstract

Talk to us

Similar Papers

More From: Neurocomputing