Depth Images Could Tell us More: Enhancing Depth Discriminability for RGB-D Scene Recognition

Dapeng Du,Tongwei Ren,Gangshan Wu,Xiangyang Xu

doi:10.1109/icme.2018.8486573

Abstract

Recently depth-modal information has been witnessed effectively in computer vision community, especially for scene analysis related tasks. However, it still suffers severely from depth data scarcity as well as improperly transferring pre-trained RGB models to fit depth-modal data. In this study, we propose a novel two-step training strategy to address these problems and focus on enhancing the recognition power for depth-modal images in RGB-D scene recognition task. Specifically, we build an effective “Res-U” architecture on a GAN (generative adversarial networks) based RGB-to-depth modality translation model, which is endowed with both short and long skips for residual learning. On one hand, this could first well pre-train a depth-modal-specific discriminator network from scratch in an unsupervised manner, which is effectively transformed for the subsequent recognition task instead of directly fitting pre-trained RGB model to depth-specific one. On the other hand, new depth images with helpful perturbations, generated from the modality translation model, help argument the original training set and regularize the learning process in some sense. This two-step training strategy makes it more effective for training a modal-specific network to discriminate depth scenes. Besides, we extensively explore the modality translation network to investigate the effects in recognizing depth-modal scenes, which encourages a reasonable way to take full advantage of multi-modalities. The proposed method achieves state-of-the-art accuracy on NYU Depth v2 and SUN RGB-D benchmark datasets, especially on depth data only evaluation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Depth Images Could Tell us More: Enhancing Depth Discriminability for RGB-D Scene Recognition

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Indoor Scene Understanding with RGB-D Images: Bottom-up Segmentation, Object Detection and Semantic Segmentation
Saurabh Gupta ... Pablo Arbeláez
International Journal of Computer Vision | VOL. 112
Saurabh Gupta, et. al.Saurabh Gupta ... Pablo Arbeláez
21 Nov 2014
International Journal of Computer Vision | VOL. 112

An Efficient RGB-D Scene Recognition Method Based on Multi-Information Fusion
Wenjuan Gong ... Xin Li
IEEE Access | VOL. 8
Wenjuan Gong, et. al.Wenjuan Gong ... Xin Li
01 Jan 2020
IEEE Access | VOL. 8

Translate-to-Recognize Networks for RGB-D Scene Recognition
Dapeng Du ... Kai Zhao
-
Dapeng Du, et. al.Dapeng Du ... Kai Zhao
01 Jun 2019
01 Jun 2019

TP-GAN: Simple Adversarial Network With Additional Player for Dense Depth Image Estimation
Andi Hendra ... Yasushi Kanazawa
IEEE Access | VOL. 11
Andi Hendra, et. al.Andi Hendra ... Yasushi Kanazawa
01 Jan 2023
IEEE Access | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Depth Images Could Tell us More: Enhancing Depth Discriminability for RGB-D Scene Recognition

Abstract

Talk to us

Similar Papers