Semantic-Context Graph Network for Point-Based 3D Object Detection

Shuwei Dong,Fan Tang,Yi Chang,Xiaoyu Kong,Wei Li,Weiming Dong,Xingjia Pan

doi:10.1109/tcsvt.2023.3271318

Abstract

Point-based indoor 3D object detection has received increasing attention with the large demand for augmented reality, autonomous driving, and robot technology in the industry. However, the detection precision suffers from inputs with semantic ambiguity, i.e., shape symmetries, occlusion, and texture missing, which would lead that different objects appearing similar from different viewpoints and then confusing the detection model. Typical point-based detectors relieve this problem via learning proposal representations with both geometric and semantic information, while the entangled representation may cause a reduction in both semantic and spatial discrimination. In this paper, we focus on alleviating the confusion from entanglement and then enhancing the proposal representation by considering the proposal’s semantics and the context in one scene. A semantic-context graph network (SCGNet) is proposed, which mainly includes two modules: a category-aware proposal recoding module (CAPR) and a proposal context aggregation module (PCAg). To produce semantically clear features from entanglement representation, the CAPR module learns a high-level semantic embedding for each category to extract discriminative semantic clues. In view of further enhancing the proposal representation and leveraging the semantic clues, the PCAg module builds a graph to mine the most relevant context in the scene. With few bells and whistles, the SCGNet achieves SOTA performance and obtains consistent gains when applying to different backbones (0.9% ~ 2.4% on ScanNet V2 and 1.6% ~ 2.2% on SUN RGB-D for mAP@0.25). Code is available at https://github.com/dsw-jlu-rgzn/SCGNet.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Semantic-Context Graph Network for Point-Based 3D Object Detection

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology

Lead the way for us

Journal: IEEE Transactions on Circuits and Systems for Video Technology	Publication Date: Nov 1, 2023
Citations: 2

Similar Papers

An Appearance-Semantic Descriptor with Coarse-to-Fine Matching for Robust VPR.
Jie Chen ... Pengshuai Hou
Sensors | VOL. 24
Jie Chen, et. al.Jie Chen ... Pengshuai Hou
29 Mar 2024
Sensors | VOL. 24

Is syntactic-category processing obligatory in visual word recognition? Evidence from Chinese
Andus Wing-Kuen Wong ... Hsuan-Chih Chen
Language and Cognitive Processes | VOL. 27
Andus Wing-Kuen Wong, et. al.Andus Wing-Kuen Wong ... Hsuan-Chih Chen
01 Nov 2012
Language and Cognitive Processes | VOL. 27

An emotion recognition mechanism based on the combination of mutual information and semantic clues
Hao-Chiang Koong Lin ... Min-Chai Hsieh
Journal of Ambient Intelligence and Humanized Computing | VOL. 3
Hao-Chiang Koong Lin, et. al.Hao-Chiang Koong Lin ... Min-Chai Hsieh
27 Oct 2011
Journal of Ambient Intelligence and Humanized Computing | VOL. 3

An investigation into the applicability of building information models in geospatial environment in support of site selection and fire response management processes
Umit Isikdag ... Ghassan Aouad
Advanced Engineering Informatics | VOL. 22
Umit Isikdag, et. al.Umit Isikdag ... Ghassan Aouad
21 Jul 2008
Advanced Engineering Informatics | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Semantic-Context Graph Network for Point-Based 3D Object Detection

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology