Encoder deep interleaved network with multi-scale aggregation for RGB-D salient object detection

Guang Feng,Jinyu Meng,Lihe Zhang,Huchuan Lu

doi:10.1016/j.patcog.2022.108666

Abstract

Recently, RGB-D salient object detection (SOD) has aroused widespread research interest. Existing RGB-D SOD approaches mainly consider the cross-modal information fusion in the decoder. And their multi-modal interaction mainly concentrates on the same level of features between RGB stream and depth stream. They do not deeply explore the coherence of multi-model features at different levels. In this paper, we design a two-stream deep interleaved encoder network to extract RGB and depth information and realize their mixing simultaneously. This network allows us to gradually learn multi-modal representation at different levels from shallow to deep. Moreover, to further fuse multi-modal features in the decoding stage, we propose a cross-modal mutual guidance module and a residual multi-scale aggregation module to implement the global guidance and local refinement of the salient region. Extensive experiments on six benchmark datasets demonstrate that the proposed approach performs favorably against most state-of-the-art methods under different evaluation metrics. During the testing stage, this model can run at a real-time speed of 93 FPS.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Encoder deep interleaved network with multi-scale aggregation for RGB-D salient object detection

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition

Lead the way for us

Journal: Pattern Recognition	Publication Date: Mar 24, 2022
Citations: 25

Similar Papers

RGB-D Salient Object Detection via 3D Convolutional Neural Networks
Qian Chen ... Keren Fu
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35
Qian Chen, et. al.Qian Chen ... Keren Fu
18 May 2021
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35

UTDNet: A unified triplet decoder network for multimodal salient object detection
Fushuo Huo ... Song Guo
Neural Networks | VOL. 170
Fushuo Huo, et. al.Fushuo Huo ... Song Guo
24 Nov 2023
Neural Networks | VOL. 170

ECW-EGNet: Exploring Cross-ModalWeighting and edge-guided decoder network for RGB-D salient object detection
Chenxing Xia ... Xianjin Fang
Computer Science and Information Systems | VOL. 21
Chenxing Xia, et. al.Chenxing Xia ... Xianjin Fang
01 Jan 2024
Computer Science and Information Systems | VOL. 21

3-D Convolutional Neural Networks for RGB-D Salient Object Detection and Beyond.
Qian Chen ... Qijun Zhao
IEEE Transactions on Neural Networks and Learning Systems | VOL. 35
Qian Chen, et. al.Qian Chen ... Qijun Zhao
01 Mar 2024
IEEE Transactions on Neural Networks and Learning Systems | VOL. 35

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Encoder deep interleaved network with multi-scale aggregation for RGB-D salient object detection

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition