Semantic 3D Reconstruction for Robotic Manipulators with an Eye-In-Hand Vision System

Fusheng Zha,Mantian Li,Yu Fu,Hegao Cai,Pengfei Wang,Wei Guo,Xin Wang

doi:10.3390/app10031183

Abstract

Three-dimensional reconstruction and semantic understandings have attracted extensive attention in recent years. However, current reconstruction techniques mainly target large-scale scenes, such as an indoor environment or automatic self-driving cars. There are few studies on small-scale and high-precision scene reconstruction for manipulator operation, which plays an essential role in the decision-making and intelligent control system. In this paper, a group of images captured from an eye-in-hand vision system carried on a robotic manipulator are segmented by deep learning and geometric features and create a semantic 3D reconstruction using a map stitching method. The results demonstrate that the quality of segmented images and the precision of semantic 3D reconstruction are effectively improved by our method.

Highlights

IntroductionThe type and shape of the objects are unpredictable. While, in order to achieve autonomous operations, the robot must be able to use visual sensors, such as lasers or cameras, to get the information about the scene [1,2,3]
In an unstructured environment, the type and shape of the objects are unpredictable
As previous 3D reconstruction using an eye-in-hand camera rarely contains semantic information and, currently, a large number of semantic 3D reconstruction is based on hand-held cameras, we discuss the following two parts: semantic 3D reconstruction based on an eye-in-hand camera and a hand-held camera

Summary

Introduction

The type and shape of the objects are unpredictable. While, in order to achieve autonomous operations, the robot must be able to use visual sensors, such as lasers or cameras, to get the information about the scene [1,2,3]. We explore to establish an integrated 3D object semantic reconstruction framework for eye-in-hand manipulators, including RGBD image segmentation, camera pose optimization, and map. This enables us to achieve the following: (1) combine deep learning with geometric feature methods to perform the semantic segmentation; (2) employ the object point cloud segmentation-based. Segment Iterative Closest Point (SICP) method to optimize the camera pose and position; and (3) stitch together a semantic 3D map by data association. The accuracy of image segmentation and the quality of object modeling are improved with an eye-in-hand manipulator through combining deep learning with geometric methods.

Related Works

Semantic 3D Reconstruction Based on an Eye-in-Hand Camera

Semantic 3D Reconstruction Based on a Hand-Held Camera

Overview of the Proposed Method

Point Cloud Segmentation Based on the Geometric Feature Method

Fusion Segmentation

Camera Pose Optimization

Data Association and Map Stitching

Experimental Conditions

Results

Three-Dimensional Reconstruction Results

YCB dataset results

Discussion and Conclusions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Feb 10, 2020
Citations: 6	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Semantic 3D Reconstruction for Robotic Manipulators with an Eye-In-Hand Vision System

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Deep dual-side learning ensemble model for Parkinson speech recognition
Jie Ma ... Yan Lei
Biomedical Signal Processing and Control | VOL. 69
Jie Ma, et. al.Jie Ma ... Yan Lei
23 Jun 2021
Biomedical Signal Processing and Control | VOL. 69

Comparison of deep learned and texture features in mammographic mass classification
Guobin Li ... Cory Thomas
-
Guobin Li, et. al.Guobin Li ... Cory Thomas
13 Jul 2022
13 Jul 2022

Deep transfer learning radiomics model based on temporal bone CT for assisting in the diagnosis of inner ear malformations
Xing Zhao ... Pu Dai
Lin chuang er bi yan hou tou jing wai ke za zhi = Journal of clinical otorhinolaryngology, head, and neck surgery | VOL. 38
Xing Zhao, et. al.Xing Zhao ... Pu Dai
01 Jun 2024
Lin chuang er bi yan hou tou jing wai ke za zhi = Journal of clinical otorhinolaryngology, head, and neck surgery | VOL. 38

Improving Deep Learning Feature with Facial Texture Feature for Face Recognition
Yunfei Li ... Zhaoyang Lu
Wireless Personal Communications | VOL. 103
Yunfei Li, et. al.Yunfei Li ... Zhaoyang Lu
09 Feb 2018
Wireless Personal Communications | VOL. 103

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Semantic 3D Reconstruction for Robotic Manipulators with an Eye-In-Hand Vision System

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences