Feature aggregation with transformer for RGB-T salient object detection

Ping Zhang,Mengnan Xu,Ziyan Zhang,Pan Gao,Jing Zhang

doi:10.1016/j.neucom.2023.126329

Abstract

The main purpose of RGB-T salient object detection (SOD) is to fully integrate and exploit the information from the complementary fusion of modalities to address the underperformance of RGB SOD in some challenging scenes. In this paper, we propose a novel feature aggregation network that can fully mine multi-scale and multi-modal information for complete and accurate RGB-T SOD. Subsequently, a cross-attention fusion module is proposed to adaptively integrate high-level features by using the attention mechanism in the Transformer. Then we design a simple yet effective fast feature aggregation module to fuse low-level features. Through the combined work of the above modules, our network can perform well in some complex scenes by effectively fusing features from RGB and thermal modalities. Finally, several experiments on publicly available datasets such as VT821, VT1000, and VT5000 demonstrate that our method outperforms state-of-the-art methods. And our code has been released at:https://github.com/ELOESZHANG/FANet.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Feature aggregation with transformer for RGB-T salient object detection

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Journal: Neurocomputing	Publication Date: May 12, 2023
Citations: 6

Similar Papers

Estimating Obstacle Maps for USVs Based on a Multistage Feature Aggregation and Semantic Feature Separation Network
Jingyi Liu ... Hengyu Li
Journal of Intelligent & Robotic Systems | VOL. 102
Jingyi Liu, et. al.Jingyi Liu ... Hengyu Li
24 Apr 2021
Journal of Intelligent & Robotic Systems | VOL. 102

Quality-Aware Feature Aggregation Network for Robust RGBT Tracking
Yabin Zhu ... Bin Luo
IEEE Transactions on Intelligent Vehicles | VOL. 6
Yabin Zhu, et. al.Yabin Zhu ... Bin Luo
23 Mar 2020
IEEE Transactions on Intelligent Vehicles | VOL. 6

Cross-Modality Double Bidirectional Interaction and Fusion Network for RGB-T Salient Object Detection
Zhengxuan Xie ... Feng Shao
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 33
Zhengxuan Xie, et. al.Zhengxuan Xie ... Feng Shao
01 Aug 2023
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 33

UTDNet: A unified triplet decoder network for multimodal salient object detection
Fushuo Huo ... Song Guo
Neural Networks | VOL. 170
Fushuo Huo, et. al.Fushuo Huo ... Song Guo
24 Nov 2023
Neural Networks | VOL. 170

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Feature aggregation with transformer for RGB-T salient object detection

Abstract

Talk to us

Similar Papers

More From: Neurocomputing