Depth Privileged Scene Recognition via Dual Attention Hallucination.

Junjie Chen,Li Niu,Liqing Zhang

doi:10.1109/tip.2021.3122955

Abstract

RGB-D scene recognition has achieved promising performance because depth could provide complementary geometric information to RGB images. However, the inaccessibility of depth sensors severely limits RGB-D applications. In this paper, we focus on depth privileged setting, in which depth information is only available during training but not available during testing. Considering that the information obtained from RGB and depth images are complementary while attention is informative and transferable, our idea is using RGB input to hallucinate depth attention. We build our model upon modulated deformable convolutional layer and hallucinate dual attention: post-hoc importance weight and trainable spatial transformation. Specifically, we use modulation (resp., offset) learned from RGB to mimic Grad-CAM (resp., offset) learned from depth, to combine the strength of dual attention. We also design a weighted loss to avoid negative transfer according to the quality of depth attention. Extensive experiments on two benchmarks, i.e., SUN RGB-D and NYUDv2, demonstrate that our method outperforms the state-of-the-art methods for depth privileged scene recognition.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Depth Privileged Scene Recognition via Dual Attention Hallucination.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Image Processing

Lead the way for us

Journal: IEEE Transactions on Image Processing	Publication Date: Jan 1, 2021
Citations: 3

Similar Papers

Learning Effective RGB-D Representations for Scene Recognition.
Xinhang Song ... Chengpeng Chen
IEEE Transactions on Image Processing | VOL. 28
Xinhang Song, et. al.Xinhang Song ... Chengpeng Chen
17 Sep 2018
IEEE Transactions on Image Processing | VOL. 28

RGB-D Scene Recognition with Object-to-Object Relation
Xinhang Song ... Shuqiang Jiang
-
Xinhang Song, et. al.Xinhang Song ... Shuqiang Jiang
19 Oct 2017
19 Oct 2017

Multiple Classifiers-Based Feature Fusion for RGB-D Object Recognition
Yan Wu ... Jing Bai
International Journal of Pattern Recognition and Artificial Intelligence | VOL. 31
Yan Wu, et. al.Yan Wu ... Jing Bai
27 Feb 2017
International Journal of Pattern Recognition and Artificial Intelligence | VOL. 31

BDR6D: Bidirectional Deep Residual Fusion Network for 6D Pose Estimation
Penglei Liu ... Jun Cheng
IEEE Transactions on Automation Science and Engineering | VOL. 21
Penglei Liu, et. al.Penglei Liu ... Jun Cheng
01 Apr 2024
IEEE Transactions on Automation Science and Engineering | VOL. 21

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Depth Privileged Scene Recognition via Dual Attention Hallucination.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Image Processing