Attention-guided Multi-modality Interaction Network for RGB-D Salient Object Detection

Ruimin Wang,Fuming Sun,Fasheng Wang,Yiming Su,Haojie Li,Jing Sun

doi:10.1145/3624747

Ruimin Wang, Fuming Sun + Show 4 more

Open Access

PDF Available

https://doi.org/10.1145/3624747

Copy DOI

Export

Save

Cite

Abstract
Full-Text PDF
Similar Papers

Abstract

Listen

The past decade has witnessed great progress in RGB-D salient object detection (SOD). However, there are two bottlenecks that limit its further development. The first one is low-quality depth maps. Most existing methods directly use raw depth maps to perform detection, but low-quality depth images can bring negative impacts to the detection performance. Hence, it is not desirable to utilize depth maps indiscriminately. The other one is how to effectively predict salient maps with clear boundary and complete salient region. To address these problems, an Attention-Guided Multi-Modality Interaction Network (AMINet) is proposed. First, we propose a new quality enhancement strategy for unreliable depth images, named D epth E nhancement M odule ( DEM ). With respect to the second issue, we propose C ross- M odality A ttention M odule ( CMAM ) to rapidly locate salient region. The B oundary- A ware M odule ( BAM ) is designed to utilize high-level feature to guide the low-level feature generation in a top-down way to make up for the dilution of the boundary. To further improve the accuracy, we propose A trous R efined B lock ( ARB ) to adaptively compensate for the shortcoming of atrous convolution. By integrating these interactive modules, features from depth and RGB streams can be refined efficiently, which consequently boosts the detection performance. Experimental results demonstrate the proposed AMINet exceeds state-of-the-art (SOTA) methods on several public RGB-D datasets.

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Attention-guided Multi-modality Interaction Network for RGB-D Salient Object Detection

Abstract

Published Version (Free)

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications

Lead the way for us

Journal: ACM Transactions on Multimedia Computing, Communications, and Applications	Publication Date: Oct 23, 2023
Citations: 12

Similar Papers

DAST: Depth-Aware Assessment and Synthesis Transformer for RGB-D Salient Object Detection
Chenxing Xia ... Songsong Duan
-
Chenxing Xia, et. al.Chenxing Xia ... Songsong Duan
01 Jan 2021
01 Jan 2021

CDNet: Complementary Depth Network for RGB-D Salient Object Detection.
Wen-Da Jin ... Ming-Ming Cheng
IEEE Transactions on Image Processing | VOL. 30
Wen-Da Jin, et. al.Wen-Da Jin ... Ming-Ming Cheng
01 Jan 2020
IEEE Transactions on Image Processing | VOL. 30

Depth cue enhancement and guidance network for RGB-D salient object detection
Xiang Li ... Meng Dai
Journal of Visual Communication and Image Representation | VOL. 95
Xiang Li, et. al.Xiang Li ... Meng Dai
20 Jun 2023
Journal of Visual Communication and Image Representation | VOL. 95

Depth-aware salient object segmentation
Le Vu Ha ... Tran Hoang Tung
VNU Journal of Science: Computer Science and Communication Engineering | VOL. 36
Le Vu Ha, et. al.Le Vu Ha ... Tran Hoang Tung
07 Oct 2020
VNU Journal of Science: Computer Science and Communication Engineering | VOL. 36

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Attention-guided Multi-modality Interaction Network for RGB-D Salient Object Detection

Abstract

Published Version (Free)

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications