Abstract
An image can be described in terms of appearance frequency of visual words. This representation is implemented in bag-of-visual-words (BoVW)-based loop closure detection for its efficiency and effectiveness. However, traditional BoVW-based approaches are strongly affected by false positive loops due to scene ambiguity caused by redundant words in the vocabulary and fail to detect bidirectional loops in monocular mode. Aiming at overcoming these problems, we propose a novel vocabulary construction algorithm named hierarchical sequential information bottleneck (HsIB) by leveraging the maximization of mutual information (MMI) mechanism. First, feature descriptors are extracted from training images for visual vocabulary construction. Second, HsIB extracts discriminative yet informative visual words through the MMI mechanism in vocabulary construction, which treats feature descriptors clustering as a process of data compression. Finally, the clustering process reaches a tradeoff between compactness and discrimination and improves the performance of traditional BoVW-based loop closure detection. The proposed method is compared with state-of-the-art methods on publicly available datasets. We also create a challenging dataset to further evaluate the performance of HsIB on bidirectional loops. To the best of our knowledge, we are the first to implement information bottleneck (IB) method in visual-SLAM (vSLAM) loop closure detection, and we obtain impressive results.
Highlights
Recognizing a place that has already been visited is referred to loop closure detection in simultaneous localization and mapping (SLAM) [1]
To the best of our knowledge, this is the first time that mutual information (MMI) mechanism has been implemented in the task of loop closure detection; 2) We propose a novel vocabulary construction algorithm hierarchical sequential information bottleneck (HsIB), which aims to find a balance between
In this paper, we present an algorithm for constructing a vocabulary of visual words by utilizing MMI mechanism, which is named HsIB
Summary
Recognizing a place that has already been visited is referred to loop closure detection in simultaneous localization and mapping (SLAM) [1]. DBoW2 [6] is a state-of-the-art BoVW-based approach for loop closure detection, which constructs a visual vocabulary offline It is an open source C++ repository which is used to extract features from training images and convert them to word appearance frequencies. MMI LOOP CLOSURE DETECTION In BoVW-based loop closure detection, an image is characterized by the appearance frequency of the visual words in the visual vocabulary This process is performed by mapping the local feature descriptors to visual vocabulary, which is constructed through clustering feature descriptors. X i+1(k ∗ (node − 1) + j) = tj; generate V according to the cluster centers of all the nodes in level d for kd visual words; return V ; The input of Algorithm 1 includes the ORB feature descriptors of the training image set and the vocabulary tree parameters. A tree with depth of 3 and 10 branches, thereby resulting in 1.4 MB of leaf nodes, uses 200 MB of memory
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have