Employing Bilinear Fusion and Saliency Prior Information for RGB-D Salient Object Detection

Nianchang Huang,Qiang Zhang,Yang Yang,Jungong Han,Dingwen Zhang

doi:10.1109/tmm.2021.3069297

Nianchang Huang, Qiang Zhang + Show 3 more

Open Access

https://doi.org/10.1109/tmm.2021.3069297

Copy DOI

Abstract

Multi-modal feature fusion and saliency reasoning are two core sub-tasks of RGB-D salient object detection. However, most existing models employ linear fusion strategies (e.g., concatenation) for multi-modal feature fusion and use a simple coarse-to-fine structure for saliency reasoning. Despite their simpleness, they can neither fully capture the cross-modal complementary information nor exploit the multi-level complementary information among the cross-modal features at different levels. To address these issues, a novel RGB-D salient object detection model is presented, where we pay special attention to the aforementioned two sub-tasks. Concretely, a multi-modal feature interaction module is first presented to explore more interactions between the unimodal RGB and depth features. It helps to capture their cross-modal complementary information by jointly using some simple linear fusion strategies and bilinear fusion ones. Then, a saliency prior information guided fusion module is presented to exploit the multi-level complementary information among the fused cross-modal features at different levels. Instead of employing a simple convolutional layer for the final saliency prediction, a saliency refinement and prediction module is designed to better exploit those extracted multi-level cross-modal information for RGB-D saliency detection. Experimental results on several benchmark datasets verify the effectiveness and superiority of the proposed framework over some state-of-the-art methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Multimedia	Publication Date: Apr 1, 2021
Citations: 47	License type: other-oa

R Discovery Prime

R Discovery Prime

Employing Bilinear Fusion and Saliency Prior Information for RGB-D Salient Object Detection

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Multimedia

Lead the way for us

Similar Papers

MFUR-Net: Multimodal feature fusion and unimodal feature refinement for RGB-D salient object detection
Zhengqian Feng ... Mingle Zhou
Knowledge-Based Systems | VOL. 299
Zhengqian Feng, et. al.Zhengqian Feng ... Mingle Zhou
31 May 2024
Knowledge-Based Systems | VOL. 299

BMFNet: Bifurcated multi-modal fusion network for RGB-D salient object detection
Chenwang Sun ... Mingqian Zhang
Image and Vision Computing | VOL. 147
Chenwang Sun, et. al.Chenwang Sun ... Mingqian Zhang
03 May 2024
Image and Vision Computing | VOL. 147

Middle-Level Feature Fusion for Lightweight RGB-D Salient Object Detection.
Nianchang Huang ... Qiang Jiao
IEEE Transactions on Image Processing | VOL. 31
Nianchang Huang, et. al.Nianchang Huang ... Qiang Jiao
01 Jan 2021
IEEE Transactions on Image Processing | VOL. 31

RGB-D salient object detection based on multimodal feature information fusion
Lingbing Meng ... Qingqing Liu
-
Lingbing Meng, et. al.Lingbing Meng ... Qingqing Liu
03 Feb 2023
03 Feb 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Employing Bilinear Fusion and Saliency Prior Information for RGB-D Salient Object Detection

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Multimedia