Perspective Flow Aggregation for Data-Limited 6D Object Pose Estimation

Yinlin Hu,Mathieu Salzmann,Pascal Fua

doi:10.1007/978-3-031-20086-1_6

Abstract

AbstractMost recent 6D object pose estimation methods, including unsupervised ones, require many real training images. Unfortunately, for some applications, such as those in space or deep under water, acquiring real images, even unannotated, is virtually impossible. In this paper, we propose a method that can be trained solely on synthetic images, or optionally using a few additional real ones. Given a rough pose estimate obtained from a first network, it uses a second network to predict a dense 2D correspondence field between the image rendered using the rough pose and the real image and infers the required pose correction. This approach is much less sensitive to the domain shift between synthetic and real images than state-of-the-art methods. It performs on par with methods that require annotated real images for training when not using any, and outperforms them considerably when using as few as twenty real images.Keywords6D object pose estimation6D object pose refinementImage synthesisDense 2D correspondenceDomain adaptation

Full Text