Real-Time Dense Monocular SLAM With Online Adapted Depth Prediction Network

Hongcheng Luo,Xin Yang,Chunyuan Liao,Yang Gao,Kwang-Ting Cheng,Yuhao Wu

doi:10.1109/tmm.2018.2859034

Abstract

Considerable advances have been achieved in estimating the depth map from a single image via convolutional neural networks (CNNs) during the past few years. Combining depth prediction from CNNs with conventional monocular simultaneous localization and mapping (SLAM) is promising for accurate and dense monocular reconstruction, in particular addressing the two long-standing challenges in conventional monocular SLAM: low map completeness and scale ambiguity. However, depth estimated by pretrained CNNs usually fails to achieve sufficient accuracy for environments of different types from the training data, which are common for certain applications such as obstacle avoidance of drones in unknown scenes. Additionally, inaccurate depth prediction of CNN could yield large tracking errors in monocular SLAM. In this paper, we present a real-time dense monocular SLAM system, which effectively fuses direct monocular SLAM with an online-adapted depth prediction network for achieving accurate depth prediction of scenes of different types from the training data and providing absolute scale information for tracking and mapping. Specifically, on one hand, tracking pose (i.e., translation and rotation) from direct SLAM is used for selecting a small set of highly effective and reliable training images, which acts as ground truth for tuning the depth prediction network on-the-fly toward better generalization ability for scenes of different types. A stage-wise Stochastic Gradient Descent algorithm with a selective update strategy is introduced for efficient convergence of the tuning process. On the other hand, the dense map produced by the adapted network is applied to address scale ambiguity of direct monocular SLAM which in turn improves the accuracy of both tracking and overall reconstruction. The system with assistance of both CPUs and GPUs, can achieve real-time performance with progressively improved reconstruction accuracy. Experimental results on public datasets and live application to obstacle avoidance of drones demonstrate that our method outperforms the state-of-the-art methods with greater map completeness and accuracy, and a smaller tracking error.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Real-Time Dense Monocular SLAM With Online Adapted Depth Prediction Network

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Multimedia

Lead the way for us

Journal: IEEE Transactions on Multimedia	Publication Date: Feb 1, 2019
Citations: 81

Similar Papers

Pseudo RGB-D for Self-improving Monocular SLAM and Depth Prediction
Lokender Tiwari ... Quoc-Huy Tran
-
Lokender Tiwari, et. al.Lokender Tiwari ... Quoc-Huy Tran
01 Jan 2020
01 Jan 2020

Improved Real-Time Monocular SLAM Using Semantic Segmentation on Selective Frames
Jinkyu Lee ... Sung Soo Hwang
IEEE Transactions on Intelligent Transportation Systems | VOL. 24
Jinkyu Lee, et. al.Jinkyu Lee ... Sung Soo Hwang
01 Mar 2023
IEEE Transactions on Intelligent Transportation Systems | VOL. 24

SVR-Net: A Sparse Voxelized Recurrent Network for Robust Monocular SLAM with Direct TSDF Mapping.
Rongling Lang ... Qing Chang
Sensors (Basel, Switzerland) | VOL. 23
Rongling Lang, et. al.Rongling Lang ... Qing Chang
13 Apr 2023
Sensors (Basel, Switzerland) | VOL. 23

EMoVI-SLAM: Embedded Monocular Visual Inertial SLAM with Scale Update for Large Scale Mapping and Localization
Sami Ullah ... Bowen Song
-
Sami Ullah, et. al.Sami Ullah ... Bowen Song
01 Aug 2018
01 Aug 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Real-Time Dense Monocular SLAM With Online Adapted Depth Prediction Network

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Multimedia