Abstract
6D target object detection is of great importance to many applications such as robotics, industrial automation, and unmanned vehicles and is increasingly influencing broad industries including manufacturing, transportation, and retail industries, to name a few. This paper focuses on detecting the 6D poses of the target objects from the point cloud of a cluttered scene. However, conventional point cloud-based 6D object detection methods rely on the robustness of key-point detection results that are not straightforward for humans to understand. The drawback makes conventional point cloud-based methods require expert knowledge to tune. In this paper, we introduced a 6D target object detection method that uses segmented object point cloud patches instead of key points to predict object 6D poses and identity. Our main contributions are an end-to-end data-driven pose correction model that is enhanced with a novel simple yet efficient basis spanning layer booster. Experiments show that although the proposed model is trained only using object CAD models, its 6D detection performance matches that of the models using view data. Thus, the proposed method is suitable for 6D detection applications that have object CAD models instead of labeled scene data.
Highlights
The 6D target object detection refers to the process of identifying an object and estimating its 6-DOF pose in the scene of interest
We focus on the problem of 6D target object detection on cluttered scenes where the object is represented by its shape or geometry e.g., a point cloud
A pose correction model (PCM) that takes the point cloud data as the input is proposed to predict the 6D pose and identity of the segmented point cloud patches in an end-to-end fashion
Summary
The 6D target object detection refers to the process of identifying an object and estimating its 6-DOF pose in the scene of interest. The conventional point cloud-based 6D detection methods first detect repeatable key-points, the geometry information around the key-point is encoded into numerical features using feature descriptors [9], [10] In this way, the object can be detected based on the feature matching result. We firstly segment the scene point cloud into patches, and the patches are processed by the proposed pose correction model (PCM) to predict object 6D poses and identify the object in an end-to-end fashion. In this way, the key-point detection procedure is substituted by point cloud segmentation.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.