Abstract

The estimation of a fundamental matrix (F-matrix) from two-view images is a crucial problem in epipolar geometry, and a key point in visual simultaneous localization and mapping (VSLAM). Conventional robust methods proposed by the data calculation space, such as Random Sample Consensus (RANSAC), encounter computational inefficiency and low accuracy when the outliers exceed 50%. In this paper, a semantic filter-based on faster region-based convolutional neural network (faster R-CNN) is proposed to solve the outlier problem in RANSAC based F-matrix calculations. The semantic filter is trained using semantic patches tailored by inliers, providing different semantic labels in various image regions. First, the patches classified into the top three bad labels are filtered out during the pre-processing phase. Second, precise and robust correspondences are determined by the remaining high-level semantic contexts. Finally, the inliers are assessed using RANSAC to produce an accurate F-matrix. The proposed algorithm can improve the accuracy of F-matrix calculations, as low-quality feature correspondences are effectively decreased. Experiments on KITTI and ETH sequences illustrate that the 3D position error can be reduced by applying the semantic filter to the ORB-SLAM system. Further, indoor and real environment experiments demonstrate that an effective lower trajectory error is yielded with the proposed approach.

Highlights

  • The calculation of a fundamental matrix (F-matrix) from twoview images is a crucial problem in epipolar geometry, and a key component of visual simultaneous location and mapping (VSLAM) [1]

  • It should be noted that the main hardware for suitable implementation is the graphics processing unit (GPU), and its memory is at least 3GB, as suggested

  • As Random Sample Consensus (RANSAC) is the most popular tool for obtaining F-matrix estimations, and Mask R-CNN outperforms all existing deep learning-based entries on many tasks, we analyze the results provided by our proposed method, the standard RANSAC and our method is pipelined using Mask R-CNN

Read more

Summary

Introduction

The calculation of a fundamental matrix (F-matrix) from twoview images is a crucial problem in epipolar geometry, and a key component of visual simultaneous location and mapping (VSLAM) [1]. C. Shao et al.: Deep Learning-Based Semantic Filter for RANSAC-Based F-Matrix Calculation and the ORB-SLAM System image space usually have 60%∼80% ratio in all matches.

Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.