Abstract
In practical applications, how to use the complementary strengths of the direct and the feature-based methods for effective fusion may be the main challenge of simultaneous localization and mapping (SLAM). To solve this challenge, we propose the DO-SLAM, a novel fast and accurate semi-direct visual SLAM framework, which can maintain the direct method’s fast performance and the high precision and loop closure capability of the feature-based method. The direct method is used as the first half of the DO-SLAM to track the camera pose rapidly and robustly. The feature-based method is used as the second half of the DO-SLAM to refine the keyframe poses, perform loop closures, and build a globally consistent, long-term, sparse feature map that can be reused. The proposed pipeline fuses direct odometry and feature-based SLAM to perform three levels of parallel optimizations: (1) In the direct method module, the keyframe poses are estimated by minimizing the photometric error, (2) In the feature-based module, using the poses calculated by the inter-frame matching to correct and fuse the poses calculated by the direct method module as the initial poses, and the initial poses are optimized by the motion-only bundle adjustment, and (3) A pose graph optimization is used to achieve global map consistency in the presence of loop closures. Experimental evaluation on two benchmark datasets demonstrates that the proposed approach achieves higher accuracy and robustness on motion estimation compared to the other state-of-the-art methods.
Highlights
Simultaneous localization and mapping (SLAM) plays an essential role in self-driving cars, virtual and augmented reality, unmanned aerial vehicles (UAV), artificial intelligence [1], [2]
In order to effectively combine the advantages of the direct method and the feature-based method to achieve a more accurate estimation of the camera poses, a novel semi-direct approach is proposed in this study to maintain the fast performance of a direct method and the high precision and loop closure capability of a feature-based method
We present DO-SLAM, a novel fast and accurate semidirect visual SLAM framework, that combines the exactness of the feature-based method and the quickness of the direct method
Summary
Simultaneous localization and mapping (SLAM) plays an essential role in self-driving cars, virtual and augmented reality, unmanned aerial vehicles (UAV), artificial intelligence [1], [2]. In order to effectively combine the advantages of the direct method and the feature-based method to achieve a more accurate estimation of the camera poses, a novel semi-direct approach is proposed in this study to maintain the fast performance of a direct method and the high precision and loop closure capability of a feature-based method. This approach uses the DSO [6] as the first half to track the camera poses rapidly and robustly and uses the nonlinear optimization based on sliding window to solve the keyframe poses and the coordinates of map-points.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.