A visual simultaneous localization and mapping approach based on scene segmentation and incremental optimization

Xiaoguo Zhang,Qihan Liu,Qing Wang,Huiqing Wang,Bingqing Zheng

doi:10.1177/1729881420977669

Xiaoguo Zhang, Qihan Liu + Show 3 more

Open Access

https://doi.org/10.1177/1729881420977669

Copy DOI

Abstract

Existing visual simultaneous localization and mapping (V-SLAM) algorithms are usually sensitive to the situation with sparse landmarks in the environment and large view transformation of camera motion, and they are liable to generate large pose errors that lead to track failures due to the decrease of the matching rate of feature points. Aiming at the above problems, this article proposes an improved V-SLAM method based on scene segmentation and incremental optimization strategy. In the front end, this article proposes a scene segmentation algorithm considering camera motion direction and angle. By segmenting the trajectory and adding camera motion direction to the tracking thread, an effective prediction model of camera motion in the scene with sparse landmarks and large view transformation is realized. In the back end, this article proposes an incremental optimization method combining segmentation information and an optimization method for tracking prediction model. By incrementally adding the state parameters and reusing the computed results, high-precision results of the camera trajectory and feature points are obtained with satisfactory computing speed. The performance of our algorithm is evaluated by two well-known datasets: TUM RGB-D and NYUDv2 RGB-D. The experimental results demonstrate that our method improves the computational efficiency by 10.2% compared with state-of-the-art V-SLAMs on the desktop platform and by 22.4% on the embedded platform, respectively. Meanwhile, the robustness of our method is better than that of ORB-SLAM2 on the TUM RGB-D dataset.

Highlights

In recent years, the visual simultaneous localization and mapping (V-SLAM) technology has developed significantly
Aiming at the above problems, this article proposes an improved V-SLAM method based on scene segmentation and incremental optimization
The experimental results demonstrate that our method improves the computational efficiency by 10.2% compared with state-of-the-art V-SLAMs on the desktop platform and by 22.4% on the embedded platform, respectively

Summary

Introduction

The visual simultaneous localization and mapping (V-SLAM) technology has developed significantly. Aiming at the above problems, this article proposes an improved V-SLAM method based on scene segmentation and incremental optimization. Different from those mainstream V-SLAM systems using all feature point information in the keyframe, this article selects the landmarks near the current line-of-sight vector to match with the last motion direction of the camera so as to improve the positioning accuracy. In the front-end module, by adding scene segmentation, camera motion direction to the tracking thread, and segmenting the camera trajectory, the effective prediction of camera motion in the road segments with sparse landmarks and large view transformation is realized. The robustness of our method is better than ORB-SLAM23,4 on the TUM RGB-D dataset.[1]

Methods

Results

Conclusion