GroupTransNet: Group transformer network for RGB-D salient object detection

Xian Fang,Mingfeng Jiang,Jinchao Zhu,Xiuli Shao,Hongpeng Wang

doi:10.1016/j.neucom.2024.127865

Abstract

As an active topic in computer vision, RGB-D salient object detection has witnessed substantial progress. Although the existing methods have achieved appreciable performance, there are still some challenges. The locality of convolutional neural networks requires that the model has a sufficiently deep global receptive field, while the local characteristic represented by transformer with strong globality is always not enough. Besides, the shared information of contextual features tends to be usually overlooked. To address these bottlenecks, we propose a novel group transformer network (GroupTransNet), which is good at learning the long-range dependencies of cross layer features to promote more perfect feature expression between high-level and low-level features. Importantly, we soft group the features of the middle and latter three levels to absorb the semantic information of slightly former level features. Firstly, the input features are adaptively purified by the element-wise operation and sequential attention mechanism. Afterwards, the intermediate features are uniformly fused at different layers, and then processed by several transformers in multiple groups. Finally, the output features are clustered within different classifications and combined with underlying features. Extensive experiments demonstrate the proposed GroupTransNet outperforms the competitors and achieves new state-of-the-art performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

GroupTransNet: Group transformer network for RGB-D salient object detection

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Journal: Neurocomputing	Publication Date: May 17, 2024
Citations: 2

Similar Papers

Deep Saliency with Encoded Low Level Distance Map and High Level Features
Gayoung Lee ... Junmo Kim
-
Gayoung Lee, et. al.Gayoung Lee ... Junmo Kim
01 Jun 2016
01 Jun 2016

UTDNet: A unified triplet decoder network for multimodal salient object detection
Fushuo Huo ... Song Guo
Neural Networks | VOL. 170
Fushuo Huo, et. al.Fushuo Huo ... Song Guo
24 Nov 2023
Neural Networks | VOL. 170

Discriminative feature fusion for RGB-D salient object detection
Zeyu Chen ... Chunfan Ji
Computers and Electrical Engineering | VOL. 106
Zeyu Chen, et. al.Zeyu Chen ... Chunfan Ji
20 Jan 2023
Computers and Electrical Engineering | VOL. 106

Depth-aware salient object segmentation
Le Vu Ha ... Tran Hoang Tung
VNU Journal of Science: Computer Science and Communication Engineering | VOL. 36
Le Vu Ha, et. al.Le Vu Ha ... Tran Hoang Tung
07 Oct 2020
VNU Journal of Science: Computer Science and Communication Engineering | VOL. 36

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

GroupTransNet: Group transformer network for RGB-D salient object detection

Abstract

Talk to us

Similar Papers

More From: Neurocomputing