Abstract

TheSixdegrees of freedom 6DoF pose estimation of texture-less objects provides a spatial understanding of industrial scenes and is the basis for accurate object manipulation. Recent studies have shown that the introduction of known model information and initial poses helps CNNs-based methods achieve better performance in complex scenes. However, the mapping from image space to pose space learned by neural networks is dimensionally lifting. Due to the lack of depth information, it is difficult for neural networks to capture local clues on texture-less surfaces and directly regress the relative 3-D translation and 3-D rotation. Instead, we propose a novel framework named REG-Net, which transforms the 6DoF pose estimation task into a 2-D keypoint registration problem. The proposed network first encodes regional prior information using multi-representation, utilizes the globally-consistent offset attention module to align 2-D keypoint features in a long range, and then estimates offsets and potential regions of keypoints. The proposed regional PnP simultaneously adjusts the keypoint locations in a short range and outputs the pose. This framework compresses the learning space of the network from 3-D to 2-D. Extensive experiments on two benchmark datasets demonstrate the robustness and accuracy of REG-Net. We further demonstrate the effectiveness of REG-Net in the reflective industrial part grasping applications.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.