DMNet: Dynamic Memory Network for RGB-D Salient Object Detection

Haishun Du,Zhen Zhang,Minghao Zhang,Kangyi Qiao

doi:10.1016/j.dsp.2023.104221

Abstract

RGB-D salient object detection aims to identify the most salient object using the color and depth information of images. Currently, most of the existing salient object detection models use the classical U-Net architecture, which uses up-sampling and short link decoding to exploit saliency cues after generating multi-level features by successive convolution and pooling operations. However, the multi-level encoding and progressive up-sampling decoding used by these models may lose some of the high-level semantic information during feature extraction and information fusion and some of the detailed information during image recovery, which in turn affects the quality of generated saliency maps. To solve these problems, we propose a dynamic memory network (DMNet) consisting of an interactive enhancement encoder and a dynamic memory decoder, which achieves high-precision detection of salient objects with a more complete and superior encoding and decoding strategy. Specifically, in the interactive enhancement encoder, we propose a feature interactive fusion module (FIFM), which can enhance intermediate-scale RGB features by fusing RGB features of three adjacent scales. Moreover, we design a depth-guided dense fusion module (DDFM) to fuse the features from RGB and depth modalities. In the dynamic memory decoder, we design a full-dimensional dynamic convolutional expansion module (FDCEM) to enhance the information representation capability of features at different scales. Furthermore, we also design a gated decoding module (GDM) to decode features of different scales and eliminate non-salient information. Extensive experimental results on STERE, SIP, NLPR, NJU2K, SSD, and DES datasets demonstrate that our model outperforms most of the state-of-the-art RGB-D SOD methods. In addition, the experimental results on three RGB-T datasets, VT821, VT1000, and VT5000 also show that our model can be effectively used for RGB-T SOD.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

DMNet: Dynamic Memory Network for RGB-D Salient Object Detection

Abstract

Talk to us

Similar Papers

More From: Digital Signal Processing

Lead the way for us

Similar Papers

CCAFNet: Crossflow and Cross-Scale Adaptive Fusion Network for Detecting Salient Objects in RGB-D Images
Wujie Zhou ... Lu Yu
IEEE Transactions on Multimedia | VOL. 24
Wujie Zhou, et. al.Wujie Zhou ... Lu Yu
01 Jan 2021
IEEE Transactions on Multimedia | VOL. 24

Three‐stream RGB‐D salient object detection network based on cross‐level and cross‐modal dual‐attention fusion
Lingbing Meng ... Qingqing Liu
IET Image Processing | VOL. 17
Lingbing Meng, et. al.Lingbing Meng ... Qingqing Liu
03 Jul 2023
IET Image Processing | VOL. 17

WGI-Net: A weighted group integration network for RGB-D salient object detection
Yanliang Ge ... Cong Zhang
Computational Visual Media | VOL. 7
Yanliang Ge, et. al.Yanliang Ge ... Cong Zhang
08 Jan 2021
Computational Visual Media | VOL. 7

Finding spatio-temporal salient paths for video objects discovery
Ye Luo ... Jianwei Lu
Journal of Visual Communication and Image Representation | VOL. 38
Ye Luo, et. al.Ye Luo ... Jianwei Lu
20 Feb 2016
Journal of Visual Communication and Image Representation | VOL. 38

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DMNet: Dynamic Memory Network for RGB-D Salient Object Detection

Abstract

Talk to us

Similar Papers

More From: Digital Signal Processing