DF&lt;sup&gt;2&lt;/sup&gt;Net: Discriminative Feature Learning and Fusion Network for RGB-D Indoor Scene Classification

Yabei Li,Kaiqi Huang,Yanhua Cheng,Junge Zhang,Tieniu Tan

doi:10.1609/aaai.v32i1.12292

Abstract

This paper focuses on the task of RGB-D indoor scene classification. It is a very challenging task due to two folds. 1) Learning robust representation for indoor scene is difficult because of various objects and layouts. 2) Fusing the complementary cues in RGB and Depth is nontrivial since there are large semantic gaps between the two modalities. Most existing works learn representation for classification by training a deep network with softmax loss and fuse the two modalities by simply concatenating the features of them. However, these pipelines do not explicitly consider intra-class and inter-class similarity as well as inter-modal intrinsic relationships. To address these problems, this paper proposes a Discriminative Feature Learning and Fusion Network (DF2Net) with two-stage training. In the first stage, to better represent scene in each modality, a deep multi-task network is constructed to simultaneously minimize the structured loss and the softmax loss. In the second stage, we design a novel discriminative fusion network which is able to learn correlative features of multiple modalities and distinctive features of each modality. Extensive analysis and experiments on SUN RGB-D Dataset and NYU Depth Dataset V2 show the superiority of DF2Net over other state-of-the-art methods in RGB-D indoor scene classification task.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

DF<sup>2</sup>Net: Discriminative Feature Learning and Fusion Network for RGB-D Indoor Scene Classification

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence	Publication Date: Apr 27, 2018
Citations: 9

Similar Papers

2D–3D Geometric Fusion network using Multi-Neighbourhood Graph Convolution for RGB-D indoor scene classification
Albert Mosella-Montoro ... Javier Ruiz-Hidalgo
Information Fusion | VOL. 76
Albert Mosella-Montoro, et. al.Albert Mosella-Montoro ... Javier Ruiz-Hidalgo
14 May 2021
Information Fusion | VOL. 76

FASFLNet: feature adaptive selection and fusion lightweight network for RGB-D indoor scene parsing.
Xiaohong Qian ... Xingyang Lin
Optics Express | VOL. 31
Xiaohong Qian, et. al.Xiaohong Qian ... Xingyang Lin
17 Feb 2023
Optics Express | VOL. 31

MAPNet: Multi-modal attentive pooling network for RGB-D indoor scene classification
Yabei Li ... Tieniu Tan
Pattern Recognition | VOL. 90
Yabei Li, et. al.Yabei Li ... Tieniu Tan
08 Feb 2019
Pattern Recognition | VOL. 90

PGDENet: Progressive Guided Fusion and Depth Enhancement Network for RGB-D Indoor Scene Parsing
Wujie Zhou ... Lu Yu
IEEE Transactions on Multimedia | VOL. 25
Wujie Zhou, et. al.Wujie Zhou ... Lu Yu
01 Jan 2023
IEEE Transactions on Multimedia | VOL. 25

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DF&lt;sup&gt;2&lt;/sup&gt;Net: Discriminative Feature Learning and Fusion Network for RGB-D Indoor Scene Classification

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

DF<sup>2</sup>Net: Discriminative Feature Learning and Fusion Network for RGB-D Indoor Scene Classification