SD-VIS: A Fast and Accurate Semi-Direct Monocular Visual-Inertial Simultaneous Localization and Mapping (SLAM).

Quanpan Liu,Huan Wang,Zhengjie Wang

doi:10.3390/s20051511

Abstract

In practical applications, how to achieve a perfect balance between high accuracy and computational efficiency can be the main challenge for simultaneous localization and mapping (SLAM). To solve this challenge, we propose SD-VIS, a novel fast and accurate semi-direct visual-inertial SLAM framework, which can estimate camera motion and structure of surrounding sparse scenes. In the initialization procedure, we align the pre-integrated IMU measurements and visual images and calibrate out the metric scale, initial velocity, gravity vector, and gyroscope bias by using multiple view geometry (MVG) theory based on the feature-based method. At the front-end, keyframes are tracked by feature-based method and used for back-end optimization and loop closure detection, while non-keyframes are utilized for fast-tracking by direct method. This strategy makes the system not only have the better real-time performance of direct method, but also have high accuracy and loop closing detection ability based on feature-based method. At the back-end, we propose a sliding window-based tightly-coupled optimization framework, which can get more accurate state estimation by minimizing the visual and IMU measurement errors. In order to limit the computational complexity, we adopt the marginalization strategy to fix the number of keyframes in the sliding window. Experimental evaluation on EuRoC dataset demonstrates the feasibility and superior real-time performance of SD-VIS. Compared with state-of-the-art SLAM systems, we can achieve a better balance between accuracy and speed.

Highlights

Simultaneous localization and mapping (SLAM) plays an important role in self-driving cars, virtual reality, unmanned aerial vehicles (UAV), augmented reality and artificial intelligence [1,2].This technology can provide reliable state estimation for UAV and self-driving cars in GPS-denied environments by relying on its sensors
Various types of sensors can be utilized in SLAM, such as stereo camera, lidar, inertial measurement units (IMU), and monocular camera
They have significant disadvantages when used individually: the metric scale of stereo camera can be obtained directly by using fixed baseline length, but it can only be estimated accurately in a limited depth range [3]; lidar has high precision in indoor, but it will encounter the reflection problem of glass surface in outdoor [4]; cheap IMUs are extremely susceptible to bias and noise [5]; monocular camera cannot estimate the absolute metric scale [6]

Summary

Introduction

Simultaneous localization and mapping (SLAM) plays an important role in self-driving cars, virtual reality, unmanned aerial vehicles (UAV), augmented reality and artificial intelligence [1,2]. The direct method considers the entire image or some pixels with a large gradient and directly estimates the camera motion and scene structure by minimizing the photometric error [11,12,13,14]. In [25,26], different semi-direct approaches were proposed for stereo odometry Both methods use feature-based tracking to obtain a motion prior, and perform direct semi-dense or sparse alignment to refine the camera pose. KLT sparse optical flow algorithm, which can further reduce the end, we only need to extract new feature points on the keyframes and track them with KLT sparse calculation while ensuring accuracy. SD-VIS are aretracked trackedby byfeature-based feature-based method, which is used for sliding window non-linear optimization and loop closure detection.

System Framework Overview

Definition of Symbols

IMU Pre-Integration

Visual-Inertial Alignment

Gyroscope Bias Correction

Gravity Vector Refinement

Keyframe Selection

Keyframes Tracking

Non-Keyframes

Adjusting

Adjust

Sliding Window-based Tightly-coupled Optimization Framework

Formulation

C PF o k j where rB e

Visual Re-Projection Errors

Marginalization Strategy

Re-Localization

Marginalization strategy

Accuracy and Robustness Evaluate

Figures andconclusion

Real-Time Performance Evaluate

Loop Closure Detection Evaluate

Figures and

Findings

Conclusions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Sensors	Publication Date: Mar 9, 2020
Citations: 8	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

SD-VIS: A Fast and Accurate Semi-Direct Monocular Visual-Inertial Simultaneous Localization and Mapping (SLAM).

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors

Lead the way for us

Similar Papers

Dynamic adaptive simultaneous localization and mapping technique for scene change
Dianxi Shi ... Shaowu Yang
SCIENTIA SINICA Technologica | VOL. 48
Dianxi Shi, et. al.Dianxi Shi ... Shaowu Yang
21 Nov 2018
SCIENTIA SINICA Technologica | VOL. 48

A Simultaneous Localization and Mapping System Using the Iterative Error State Kalman Filter Judgment Algorithm for Global Navigation Satellite System
Bo You ... Chen Chen
Sensors | VOL. 23
Bo You, et. al.Bo You ... Chen Chen
28 Jun 2023
Sensors | VOL. 23

Rein-SLAM: Narrow the Gaps Between the Matching Task and SLAM System
Zhenkun Zhu ... Jikai Wang
IEEE Transactions on Industrial Electronics | VOL. 70
Zhenkun Zhu, et. al.Zhenkun Zhu ... Jikai Wang
01 Oct 2023
IEEE Transactions on Industrial Electronics | VOL. 70

A Survey of Underwater Acoustic SLAM System
Min Jiang ... Sanming Song
-
Min Jiang, et. al.Min Jiang ... Sanming Song
01 Jan 2019
01 Jan 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

SD-VIS: A Fast and Accurate Semi-Direct Monocular Visual-Inertial Simultaneous Localization and Mapping (SLAM).

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors