Visual simultaneous localization and mapping (vSLAM) algorithm based on improved Vision Transformer semantic segmentation in dynamic scenes

Mengyuan Chen,Hao Cheng,Hangrong Guo,Guangqiang Gong,Runbang Qian

doi:10.5194/ms-15-1-2024

Abstract

Abstract. Identifying dynamic objects in dynamic scenes remains a challenge for traditional simultaneous localization and mapping (SLAM) algorithms. Additionally, these algorithms are not able to adequately inpaint the culling regions that result from excluding dynamic objects. In light of these challenges, this study proposes a novel visual SLAM (vSLAM) algorithm based on improved Vision Transformer semantic segmentation in dynamic scenes (VTD-SLAM), which leverages an improved Vision Transformer semantic segmentation technique to address these limitations. Specifically, VTD-SLAM utilizes a residual dual-pyramid backbone network to extract dynamic object region features and a multiclass feature transformer segmentation module to enhance the pixel weight of potential dynamic objects and to improve global semantic information for precise identification of potential dynamic objects. The method of multi-view geometry is applied to judge and remove the dynamic objects. Meanwhile, according to static information in the adjacent frames, the optimal nearest-neighbor pixel-matching method is applied to restore the static background, where the feature points are extracted for pose estimation. With validation in the public dataset TUM (The Entrepreneurial University Dataset) and real scenarios, the experimental results show that the root-mean-square error of the algorithm is reduced by 17.1 % compared with dynamic SLAM (DynaSLAM), which shows better map composition capability.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Visual simultaneous localization and mapping (vSLAM) algorithm based on improved Vision Transformer semantic segmentation in dynamic scenes

Abstract

Talk to us

Similar Papers

More From: Mechanical Sciences

Lead the way for us

Journal: Mechanical Sciences	Publication Date: Jan 3, 2024
License type: CC BY 4.0

Similar Papers

Research on a line-expanded visual odometry in dynamic environment
Zhangfang Hu ... Yuan Luo
-
Zhangfang Hu, et. al.Zhangfang Hu ... Yuan Luo
10 Oct 2020
10 Oct 2020

A Review on Vision Simultaneous Localization and Mapping (VSLAM)
Jabulani K Makhubela ... Tranos Zuva
-
Jabulani K Makhubela, et. al.Jabulani K Makhubela ... Tranos Zuva
01 Dec 2018
01 Dec 2018

Framework for Visual Simultaneous Localization and Mapping in a Noisy Static Environment
Jabulani K Makhubela ... Tranos Zuva
-
Jabulani K Makhubela, et. al.Jabulani K Makhubela ... Tranos Zuva
01 Dec 2018
01 Dec 2018

Simultaneous localization and mapping on a mobile robot platform
Martin Lucan ... Frantisek Duchon
-
Martin Lucan, et. al.Martin Lucan ... Frantisek Duchon
15 Oct 2020
15 Oct 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Visual simultaneous localization and mapping (vSLAM) algorithm based on improved Vision Transformer semantic segmentation in dynamic scenes

Abstract

Talk to us

Similar Papers

More From: Mechanical Sciences