Depth CNNs for RGB-D Scene Recognition: Learning from Scratch Better than Transferring from RGB-CNNs

Xinhang Song,Luis Herranz,Shuqiang Jiang

doi:10.1609/aaai.v31i1.11226

Abstract

Scene recognition with RGB images has been extensively studied and has reached very remarkable recognition levels, thanks to convolutional neural networks (CNN) and large scene datasets. In contrast, current RGB-D scene data is much more limited, so often leverages RGB large datasets, by transferring pretrained RGB CNN models and fine-tuning with the target RGB-D dataset. However, we show that this approach has the limitation of hardly reaching bottom layers, which is key to learn modality-specific features. In contrast, we focus on the bottom layers, and propose an alternative strategy to learn depth features combining local weakly supervised training from patches followed by global fine tuning with images. This strategy is capable of learning very discriminative depth-specific features with limited depth images, without resorting to Places-CNN. In addition we propose a modified CNN architecture to further match the complexity of the model and the amount of data available. For RGB-D scene recognition, depth and RGB features are combined by projecting them in a common space and further leaning a multilayer classifier, which is jointly optimized in an end-to-end network. Our framework achieves state-of-the-art accuracy on NYU2 and SUN RGB-D in both depth only and combined RGB-D data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Depth CNNs for RGB-D Scene Recognition: Learning from Scratch Better than Transferring from RGB-CNNs

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Feb 12, 2017
Citations: 31

Similar Papers

When CNNs meet random RNNs: Towards multi-level analysis for RGB-D object and scene recognition
Ali Caglayan ... Ryosuke Nakamura
Computer Vision and Image Understanding | VOL. 217
Ali Caglayan, et. al.Ali Caglayan ... Ryosuke Nakamura
21 Jan 2022
Computer Vision and Image Understanding | VOL. 217

Performance Improvement Of Pre-trained Convolutional Neural Networks For Action Recognition
Tayyip Ozcan ... Alper Basturk
The Computer Journal | VOL. 64
Tayyip Ozcan, et. al.Tayyip Ozcan ... Alper Basturk
15 Jun 2020
The Computer Journal | VOL. 64

Convolutional neural network for sapphire ingots defect detection and classification
Euphrem Mugisha Rwagasore ... Mengtong Wang
Optical Materials | VOL. 119
Euphrem Mugisha Rwagasore, et. al.Euphrem Mugisha Rwagasore ... Mengtong Wang
01 Sep 2021
Optical Materials | VOL. 119

An automated diagnosis and classification of COVID-19 from chest CT images using a transfer learning-based convolutional neural network
Nadiah A Baghdadi ... Mostafa Elhosseini
Computers in Biology and Medicine | VOL. 144
Nadiah A Baghdadi, et. al.Nadiah A Baghdadi ... Mostafa Elhosseini
10 Mar 2022
Computers in Biology and Medicine | VOL. 144

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Depth CNNs for RGB-D Scene Recognition: Learning from Scratch Better than Transferring from RGB-CNNs

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence