A novel monocular visual simultaneous localization and mapping (SLAM) algorithm built on the semi-direct method is proposed to deal with some problems in complex environments, such as low-texture, moving objects and perceptual aliasing. The proposed algorithm takes advantage of direct and feature-based methods. On one hand, a direct method is used to track the camera poses and solve the feature alignment. On the other hand, ORB features in keyframes are extracted and matched for optimization and loop closure. To improve the localization accuracy in dynamic environments, a motion detection module that is robust to illumination change is adopted. In addition, for the sake of resolving the loop closure detection problem in perceptual aliasing scenes, this paper fuses the spatial information between two visual words into the bag of visual words (BoVW) model and employs an improved pyramid term frequency-inverse document frequency (TF-IDF) scoring match scheme. Experimental results prove that the proposed algorithm behaves better performance than ORB-SLAM with regard to overall accuracy and speed in complex environments.
Read full abstract