Unsupervised framework for depth estimation and camera motion prediction from video

Delong Yang,Xunyu Zhong,Dongbing Gu,Xiafu Peng,Huosheng Hu

doi:10.1016/j.neucom.2019.12.049

Abstract

Depth estimation from monocular video plays a crucial role in scene perception. The significant drawback of supervised learning models is the need for vast amounts of manually labeled data (ground truth) for training. To overcome this limitation, unsupervised learning strategies without the requirement for ground truth have achieved extensive attention from researchers in the past few years. This paper presents a novel unsupervised framework for estimating single-view depth and predicting camera motion jointly. Stereo image sequences are used to train the model while monocular images are required for inference. The presented framework is composed of two CNNs (depth CNN and pose CNN) which are trained concurrently and tested independently. The objective function is constructed on the basis of the epipolar geometry constraints between stereo image sequences. To improve the accuracy of the model, a left-right consistency loss is added to the objective function. The use of stereo image sequences enables us to utilize both spatial information between stereo images and temporal photometric warp error from image sequences. Experimental results on the KITTI and Cityscapes datasets show that our model not only outperforms prior unsupervised approaches but also achieving better results comparable with several supervised methods. Moreover, we also train our model on the Euroc dataset which is captured in an indoor environment. Experiments in indoor and outdoor scenes are conducted to test the generalization capability of the model.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Unsupervised framework for depth estimation and camera motion prediction from video

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Journal: Neurocomputing	Publication Date: Dec 19, 2019
Citations: 15

Similar Papers

Estimating 3D vehicle motion in an outdoor scene from monocular and stereo image sequences
M.K Leung ... T.S Huang
-
M.K Leung, et. al.M.K Leung ... T.S Huang
07 Oct 1991
07 Oct 1991

DVONet: Unsupervised Monocular Depth Estimation and Visual Odometry
Xiangyu Li ... Pichao Wang
-
Xiangyu Li, et. al.Xiangyu Li ... Pichao Wang
01 Dec 2019
01 Dec 2019

Estimation of depth and 3D motion parameters of moving objects with multiple stereo images by using Kalman filter
Jae-Woong Yi ... Jun-Ho Oh
-
Jae-Woong Yi, et. al. Jae-Woong Yi ... Jun-Ho Oh
06 Nov 1995
06 Nov 1995

Estimating three-dimensional vehicle motion in an outdoor scene using stereo image sequences
Mun K Leung ... Thomas S Huang
International Journal of Imaging Systems and Technology | VOL. 4
Mun K Leung, et. al.Mun K Leung ... Thomas S Huang
01 Jan 1992
International Journal of Imaging Systems and Technology | VOL. 4

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Unsupervised framework for depth estimation and camera motion prediction from video

Abstract

Talk to us

Similar Papers

More From: Neurocomputing