ACM: Adaptive Cross-Modal Graph Convolutional Neural Networks for RGB-D Scene Recognition

Yuan Yuan,Qi Wang,Zhitong Xiong

doi:10.1609/aaai.v33i01.33019176

Abstract

RGB image classification has achieved significant performance improvement with the resurge of deep convolutional neural networks. However, mono-modal deep models for RGB image still have several limitations when applied to RGB-D scene recognition. 1) Images for scene classification usually contain more than one typical object with flexible spatial distribution, so the object-level local features should also be considered in addition to global scene representation. 2) Multi-modal features in RGB-D scene classification are still under-utilized. Simply combining these modal-specific features suffers from the semantic gaps between different modalities. 3) Most existing methods neglect the complex relationships among multiple modality features. Considering these limitations, this paper proposes an adaptive crossmodal (ACM) feature learning framework based on graph convolutional neural networks for RGB-D scene recognition. In order to make better use of the modal-specific cues, this approach mines the intra-modality relationships among the selected local features from one modality. To leverage the multi-modal knowledge more effectively, the proposed approach models the inter-modality relationships between two modalities through the cross-modal graph (CMG). We evaluate the proposed method on two public RGB-D scene classification datasets: SUN-RGBD and NYUD V2, and the proposed method achieves state-of-the-art performance.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

ACM: Adaptive Cross-Modal Graph Convolutional Neural Networks for RGB-D Scene Recognition

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence	Publication Date: Jul 17, 2019
Citations: 28

Similar Papers

RGB-D Scene Recognition via Spatial-Related Multi-Modal Feature Learning
Zhitong Xiong ... Qi Wang
IEEE access : practical innovations, open solutions | VOL. 7
Zhitong Xiong, et. al.Zhitong Xiong ... Qi Wang
01 Jan 2019
IEEE access : practical innovations, open solutions | VOL. 7

ASK: Adaptively Selecting Key Local Features for RGB-D Scene Recognition.
Zhitong Xiong ... Yuan Yuan
IEEE Transactions on Image Processing | VOL. 30
Zhitong Xiong, et. al.Zhitong Xiong ... Yuan Yuan
01 Jan 2020
IEEE Transactions on Image Processing | VOL. 30

Incorporating Adaptive Sparse Graph Convolutional Neural Networks for Segmentation of Organs at Risk in Radiotherapy
Junjie Hu ... Alexander Hošovský
International Journal of Intelligent Systems | VOL. 2024
Junjie Hu, et. al.Junjie Hu ... Alexander Hošovský
26 Apr 2024
International Journal of Intelligent Systems | VOL. 2024

Discriminative Multi-modal Feature Fusion for RGBD Indoor Scene Recognition
Hongyuan Zhu ... Jean-Baptiste Weibel
-
Hongyuan Zhu, et. al.Hongyuan Zhu ... Jean-Baptiste Weibel
01 Jun 2016
01 Jun 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

ACM: Adaptive Cross-Modal Graph Convolutional Neural Networks for RGB-D Scene Recognition

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence