RGB-D Scene Classification via Multi-modal Feature Learning

Ziyun Cai,Ling Shao

doi:10.1007/s12559-018-9580-y

Abstract

Most of the past deep learning methods which are proposed for RGB-D scene classification use global information and directly consider all pixels in the whole image for high-level tasks. Such methods cannot hold much information about local feature distributions, and simply concatenate RGB and depth features without exploring the correlation and complementarity between raw RGB and depth images. From the human vision perspective, we recognize the category of one unknown scene mainly relying on the object-level information in the scene which includes the appearance, texture, shape, and depth. The structural distribution of different objects is also taken into consideration. Based on this observation, constructing mid-level representations with discriminative object parts would generally be more attractive for scene analysis. In this paper, we propose a new Convolutional Neural Networks (CNNs)-based local multi-modal feature learning framework (LM-CNN) for RGB-D scene classification. This method can effectively capture much of the local structure from the RGB-D scene images and automatically learn a fusion strategy for the object-level recognition step instead of simply training a classifier on top of features extracted from both modalities. The experimental results on two popular datasets, i.e., NYU v1 depth dataset and SUN RGB-D dataset, show that our method with local multi-modal CNNs outperforms state-of-the-art methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

RGB-D Scene Classification via Multi-modal Feature Learning

Abstract

Talk to us

Similar Papers

More From: Cognitive Computation

Lead the way for us

Journal: Cognitive Computation	Publication Date: Aug 2, 2018
Citations: 13

Similar Papers

RGB-D Object Recognition Using Multi-Modal Deep Neural Network and DS Evidence Theory.
Hui Zeng ... Xiuqing Wang
Sensors | VOL. 19
Hui Zeng, et. al.Hui Zeng ... Xiuqing Wang
27 Jan 2019
Sensors | VOL. 19

RGB-D Scene Recognition via Spatial-Related Multi-Modal Feature Learning
Zhitong Xiong ... Qi Wang
IEEE Access | VOL. 7
Zhitong Xiong, et. al.Zhitong Xiong ... Qi Wang
01 Jan 2019
IEEE Access | VOL. 7

Modality and Component Aware Feature Fusion for RGB-D Scene Classification
Anran Wang ... Jiwen Lu
-
Anran Wang, et. al.Anran Wang ... Jiwen Lu
01 Jun 2016
01 Jun 2016

Subset based deep learning for RGB-D object recognition
Jing Bai ... Fuqiang Chen
Neurocomputing | VOL. 165
Jing Bai, et. al.Jing Bai ... Fuqiang Chen
14 Mar 2015
Neurocomputing | VOL. 165

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

RGB-D Scene Classification via Multi-modal Feature Learning

Abstract

Talk to us

Similar Papers

More From: Cognitive Computation