Multi-View Fusion-Based 3D Object Detection for Robot Indoor Scene Perception.

Li Wang,Xingxing Liu,Jingwen Sun,Lijun Zhao,Hock Soon Seah,Chee Kwang Quah,Budianto Tandianus,Ruifeng Li

doi:10.3390/s19194092

Abstract

To autonomously move and operate objects in cluttered indoor environments, a service robot requires the ability of 3D scene perception. Though 3D object detection can provide an object-level environmental description to fill this gap, a robot always encounters incomplete object observation, recurring detections of the same object, error in detection, or intersection between objects when conducting detection continuously in a cluttered room. To solve these problems, we propose a two-stage 3D object detection algorithm which is to fuse multiple views of 3D object point clouds in the first stage and to eliminate unreasonable and intersection detections in the second stage. For each view, the robot performs a 2D object semantic segmentation and obtains 3D object point clouds. Then, an unsupervised segmentation method called Locally Convex Connected Patches (LCCP) is utilized to segment the object accurately from the background. Subsequently, the Manhattan Frame estimation is implemented to calculate the main orientation of the object and subsequently, the 3D object bounding box can be obtained. To deal with the detected objects in multiple views, we construct an object database and propose an object fusion criterion to maintain it automatically. Thus, the same object observed in multi-view is fused together and a more accurate bounding box can be calculated. Finally, we propose an object filtering approach based on prior knowledge to remove incorrect and intersecting objects in the object dataset. Experiments are carried out on both SceneNN dataset and a real indoor environment to verify the stability and accuracy of 3D semantic segmentation and bounding box detection of the object with multi-view fusion.

Highlights

In an indoor environment, objects are regarded as the main contents and they provide crucial clues for scene understanding and environmental perception
We propose a two-stage 3D object detection framework by fusing multiple views of a 3D point cloud based on a real-time visual SLAM for an indoor service robot
We propose an object filtering approach based on prior knowledge including size and volume ratio to remove atypical and intersecting objects in the object dataset

Summary

Introduction

Objects are regarded as the main contents and they provide crucial clues for scene understanding and environmental perception. Object detection helps the indoor service robot possess higher semantic awareness of its operating environment. It is difficult to detect these methods are still not enough for a robot to operate in 3D space. Notrobust robustenough enoughtotobe beused usedby byaa robot robot to to perform tasks tasks such such as as obstacle obstacle avoidance avoidance navigation navigation or or essential essential object object grabbing To resolve resolve this this problem, problem, 3D object object detection detection emerges emerges as as aa candidate candidate to to realize realize object object classification classification and and detection detection with with the the position position and information. Most methods utilize single RGB-D image of the same object in the NYU. We explore the multi-view fusion method to resolve the incomplete point the incomplete point cloud information of objects

Object

Related Work

C Pn should be transformed

Unsupervised Segmentation of the Object Point Cloud

The First Time to Insert Objects to the Object Database

Object Fusion Criterion and Database Maintenance

Object Database Refinement

Atypical Object Filtering Based on Prior Knowledge

Intersection Object Filtering Based on Volume Ratio

Different

Experimental Evaluation

Object-Level 3D Semantic Segmentation Evaluation

Evaluation

Conclusions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Sensors	Publication Date: Sep 21, 2019
Citations: 24	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Multi-View Fusion-Based 3D Object Detection for Robot Indoor Scene Perception.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors

Lead the way for us

Similar Papers

Simulation and Performance Analysis of 3D Object Detection Algorithm using Deep Learning for Computer Vision Applications
Tejas Mehta ... Mohana
-
Tejas Mehta, et. al.Tejas Mehta ... Mohana
04 May 2023
04 May 2023

Multi-modal feature fusion for 3D object detection in the production workshop
Rui Hou ... Qingjun Ru
Applied Soft Computing | VOL. 115
Rui Hou, et. al.Rui Hou ... Qingjun Ru
06 Dec 2021
Applied Soft Computing | VOL. 115

3D object detection: Learning 3D bounding boxes from scaled down 2D bounding boxes in RGB-D images
Mohammad Muntasir Rahman ... Ke Lu
Information Sciences | VOL. 476
Mohammad Muntasir Rahman, et. al.Mohammad Muntasir Rahman ... Ke Lu
04 Oct 2018
Information Sciences | VOL. 476

MLOD: A multi-view 3D object detection based on robust feature fusion method
Jian Deng ... Krzysztof Czarnecki
-
Jian Deng, et. al.Jian Deng ... Krzysztof Czarnecki
01 Oct 2019
01 Oct 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-View Fusion-Based 3D Object Detection for Robot Indoor Scene Perception.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors