Lightweight multi-level feature difference fusion network for RGB-D-T salient object detection

Kechen Song,Han Wang,Ying Zhao,Liming Huang,Hongwen Dong,Yunhui Yan

doi:10.1016/j.jksuci.2023.101702

Abstract

In recent years, bimodal salient object detection has developed rapidly. In view of the advanced performance of their robustness to extreme situations such as background similarity and illumination variation, researchers began to focus on RGB-Depth-Thermal salient object detection (RGB-D-T SOD). However, most existing bimodal methods usually need expensive computational costs to complete accurate prediction, and this situation is even more serious for three-modal methods, which undoubtedly limits their applicability. To solve this problem, we are the first to propose a lightweight multi-level feature difference fusion network (MFDF) for real-time RGB-D-T SOD. In view of the depth modality contains less useful information, we design an asymmetric three-stream encoder based on MobileNetV2. Due to the differences in semantics and details between high and low level features, using the same module without discrimination will lead to a large number of redundant parameters. On the contrary, in the coding stage, we introduce a cross-modal enhancement module (CME) and a cross-modal fusion module (CMF) to fuse low-level and high-level features respectively. In order to reduce redundant parameters, we design a low-level feature decoding module (LFD) and a multi-scale high-level feature fusion module (MHFF). A great deal of experiments proves that the proposed MFDF has more advantages than the 17 state-of-the-art methods. On the efficiency side, MFDF has a faster speed (124 FPS when the image size is 320 × 320) and much fewer parameters (8.9 M).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of King Saud University - Computer and Information Sciences	Publication Date: Aug 9, 2023
Citations: 1	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

Lightweight multi-level feature difference fusion network for RGB-D-T salient object detection

Abstract

Talk to us

Similar Papers

More From: Journal of King Saud University - Computer and Information Sciences

Lead the way for us

Similar Papers

Deep Saliency with Encoded Low Level Distance Map and High Level Features
Gayoung Lee ... Junmo Kim
-
Gayoung Lee, et. al.Gayoung Lee ... Junmo Kim
01 Jun 2016
01 Jun 2016

Salient Object Detection Using Recurrent Guidance Network With Hierarchical Attention Features
Shanmei Lu ... Yongxia Zhang
IEEE Access | VOL. 8
Shanmei Lu, et. al.Shanmei Lu ... Yongxia Zhang
01 Jan 2020
IEEE Access | VOL. 8

RGB-D Saliency Detection based on Cross-Modal and Multi-scale Feature Fusion
Xuxing Zhu ... Jin Wu
-
Xuxing Zhu, et. al.Xuxing Zhu ... Jin Wu
15 Aug 2022
15 Aug 2022

PS-Net: Progressive Selection Network for Salient Object Detection
Jianyi Ren ... Jinchang Ren
Cognitive Computation | VOL. 14
Jianyi Ren, et. al.Jianyi Ren ... Jinchang Ren
16 Jan 2022
Cognitive Computation | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Lightweight multi-level feature difference fusion network for RGB-D-T salient object detection

Abstract

Talk to us

Similar Papers

More From: Journal of King Saud University - Computer and Information Sciences