Unsupervised Learning of Depth and Ego-Motion with Spatial-Temporal Geometric Constraints

Anjie Wang,Jenq-Neng Hwang,Yongbin Gao,Xiaoyan Jiang,Siwei Ma,Shanshe Wang,Zhijun Fang

doi:10.1109/icme.2019.00309

Abstract

In this paper, we propose an unsupervised joint deep learning pipeline for depth and ego-motion estimation that explicitly incorporated with traditional spatial-temporal geometric constraints. The stereo reconstruction error provides the spatial geometric constraint to estimate the absolute scale depth. Meanwhile, the depth map with absolute scale and a pre-trained pose network serve as a good starting point for direct visual odometry (DVO), resulting in a fine-grained ego-motion estimation with the additional back-propagation signals provided to the depth estimation network. The proposed joint training pipeline enables an iterative coupling optimization process for accurate depth and precise ego-motion estimation. The experimental results show the state-of-the-art performance for monocular depth and ego-motion estimation on the KITTI dataset and a great generalization ability of the proposed approach.

Full Text