HRTransNet: HRFormer-Driven Two-Modality Salient Object Detection

Bin Tang,Yacheng Tan,Zhengyi Liu,Qian He

doi:10.1109/tcsvt.2022.3202563

Abstract

The High-Resolution Transformer (HRFormer) can maintain high-resolution representation and share global receptive fields. It is friendly towards salient object detection (SOD) in which the input and output have the same resolution. However, two critical problems need to be solved for two-modality SOD. One problem is two-modality fusion. The other problem is the HRFormer output's fusion. To address the first problem, a supplementary modality is injected into the primary modality by using global optimization and an attention mechanism to select and purify the modality at the input level. To solve the second problem, a dual-direction short connection fusion module is used to optimize the output features of HRFormer, thereby enhancing the detailed representation of objects at the output level. The proposed model, named HRTransNet, first introduces an auxiliary stream for feature extraction of supplementary modality. Then, features are injected into the primary modality at the beginning of each multi-resolution branch. Next, HRFormer is applied to achieve forwarding propagation. Finally, all the output features with different resolutions are aggregated by intra-feature and inter-feature interactive transformers. Application of the proposed model results in impressive improvement for driving two-modality SOD tasks, e.g., RGB-D, RGB-T, and light field SOD.https://github.com/liuzywen/HRTransNet

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

HRTransNet: HRFormer-Driven Two-Modality Salient Object Detection

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology

Lead the way for us

Journal: IEEE Transactions on Circuits and Systems for Video Technology	Publication Date: Feb 1, 2023
Citations: 30

Similar Papers

Light field salient object detection: A review and benchmark
Keren Fu ... Ge-Peng Ji
Computational Visual Media | VOL. 8
Keren Fu, et. al.Keren Fu ... Ge-Peng Ji
16 May 2022
Computational Visual Media | VOL. 8

3D salient object detection based on light field integral imaging.
Yu Kou ... Ying Li
Optics Letters | VOL. 48
Yu Kou, et. al.Yu Kou ... Ying Li
21 Sep 2023
Optics Letters | VOL. 48

Salient object detection with high‐level prior based on Bayesian fusion
Anzhi Wang ... Xiaoyan Yuan
IET Computer Vision | VOL. 11
Anzhi Wang, et. al.Anzhi Wang ... Xiaoyan Yuan
28 Feb 2017
IET Computer Vision | VOL. 11

A Multi-Task Collaborative Network for Light Field Salient Object Detection
Qiudan Zhang ... Xu Wang
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 31
Qiudan Zhang, et. al.Qiudan Zhang ... Xu Wang
07 Aug 2020
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 31

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

HRTransNet: HRFormer-Driven Two-Modality Salient Object Detection

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology