DVONet: Unsupervised Monocular Depth Estimation and Visual Odometry

Xiangyu Li,Wanqing Li,Yonghong Hou,Qi Wu,Pichao Wang

doi:10.1109/vcip47243.2019.8965952

DVONet: Unsupervised Monocular Depth Estimation and Visual Odometry

Xiangyu Li, Wanqing Li + Show 3 more

https://doi.org/10.1109/vcip47243.2019.8965952

Copy DOI

Publication Date: Dec 1, 2019

Citations: 20

Affiliation: Tianjin University, University of Wollongong, Alibaba Group (United States), Bellevue Hospital Center

#Visual Odometry #Stereo Image Sequences + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

This paper proposes an unsupervised learning framework for monocular depth estimation and visual odometry (VO), referred to as DVONet. The framework is trained using stereo image sequences and is able to estimate absolute-scale scene depth and camera poses from monocular images. To mitigate the effect of stereo occlusions in training and improve the depth estimation, left-right occlusion mask is introduced. In addition, a novel VO network is proposed where the feature extraction network is shared between pose estimation and optical flow estimation. The proposed DVONet achieves state-of-the-art results for both depth estimation and VO tasks on the KITTI driving dataset, outperforming the existing unsupervised methods and being comparable to the traditional ones.

Full Text