CAE-Net: Cross-Modal Attention Enhancement Network for RGB-T Salient Object Detection

Chengtao Lv,Yaoqi Sun,Chenggang Yan,Bin Wan,Ji Hu,Jiyong Zhang,Xiaofei Zhou

doi:10.3390/electronics12040953

Abstract

RGB salient object detection (SOD) performs poorly in low-contrast and complex background scenes. Fortunately, the thermal infrared image can capture the heat distribution of scenes as complementary information to the RGB image, so the RGB-T SOD has recently attracted more and more attention. Many researchers have committed to accelerating the development of RGB-T SOD, but some problems still remain to be solved. For example, the defective sample and interfering information contained in the RGB or thermal image hinder the model from learning proper saliency features, meanwhile the low-level features with noisy information result in incomplete salient objects or false positive detection. To solve these problems, we design a cross-modal attention enhancement network (CAE-Net). First, we concretely design a cross-modal fusion (CMF) module to fuse cross-modal features, where the cross-attention unit (CAU) is employed to enhance the two modal features, and channel attention is used to dynamically weigh and fuse the two modal features. Then, we design the joint-modality decoder (JMD) to fuse cross-level features, where the low-level features are purified by higher level features, and multi-scale features are sufficiently integrated. Besides, we add two single-modality decoder (SMD) branches to preserve more modality-specific information. Finally, we employ a multi-stream fusion (MSF) module to fuse three decoders’ features. Comprehensive experiments are conducted on three RGB-T datasets, and the results show that our CAE-Net is comparable to the other methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Electronics	Publication Date: Feb 14, 2023
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

CAE-Net: Cross-Modal Attention Enhancement Network for RGB-T Salient Object Detection

Abstract

Talk to us

Similar Papers

More From: Electronics

Lead the way for us

Similar Papers

Lightweight multi-level feature difference fusion network for RGB-D-T salient object detection
Kechen Song ... Yunhui Yan
Journal of King Saud University - Computer and Information Sciences | VOL. 35
Kechen Song, et. al.Kechen Song ... Yunhui Yan
09 Aug 2023
Journal of King Saud University - Computer and Information Sciences | VOL. 35

Salient Object Detection Using Recurrent Guidance Network With Hierarchical Attention Features
Shanmei Lu ... Yongxia Zhang
IEEE Access | VOL. 8
Shanmei Lu, et. al.Shanmei Lu ... Yongxia Zhang
01 Jan 2020
IEEE Access | VOL. 8

CCAFNet: Crossflow and Cross-Scale Adaptive Fusion Network for Detecting Salient Objects in RGB-D Images
Wujie Zhou ... Lu Yu
IEEE Transactions on Multimedia | VOL. 24
Wujie Zhou, et. al.Wujie Zhou ... Lu Yu
01 Jan 2021
IEEE Transactions on Multimedia | VOL. 24

Transformer-based difference fusion network for RGB-D salient object detection
Zhi-Qiang Cui ... Feng Wang
Journal of Electronic Imaging | VOL. 31
Zhi-Qiang Cui, et. al.Zhi-Qiang Cui ... Feng Wang
27 Dec 2022
Journal of Electronic Imaging | VOL. 31

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CAE-Net: Cross-Modal Attention Enhancement Network for RGB-T Salient Object Detection

Abstract

Talk to us

Similar Papers

More From: Electronics