Abstract

This paper presents a novel indoor topological localization method based on mobile phone videos. Conventional methods suffer from indoor dynamic environmental changes and scene ambiguity. The proposed Visual Landmark Sequence-based Indoor Localization (VLSIL) method is capable of addressing problems by taking steady indoor objects as landmarks. Unlike many feature or appearance matching-based localization methods, our method utilizes highly abstracted landmark sematic information to represent locations and thus is invariant to illumination changes, temporal variations, and occlusions. We match consistently detected landmarks against the topological map based on the occurrence order in the videos. The proposed approach contains two components: a convolutional neural network (CNN)-based landmark detector and a topological matching algorithm. The proposed detector is capable of reliably and accurately detecting landmarks. The other part is the matching algorithm built on the second order hidden Markov model and it can successfully handle the environmental ambiguity by fusing sematic and connectivity information of landmarks. To evaluate the method, we conduct extensive experiments on the real world dataset collected in two indoor environments, and the results show that our deep neural network-based indoor landmark detector accurately detects all landmarks and is expected to be utilized in similar environments without retraining and that VLSIL can effectively localize indoor landmarks.

Highlights

  • Topological localization is a fundamental component for pedestrians and robots localization, navigation, and mobile mapping [1,2]

  • We propose a novel visual landmark sequence-based approach that exploits the steady objects for indoor topological localization

  • We present a robust landmark detector using a convolutional neural network (CNN) for landmark detection that does not need to retrain for new environments

Read more

Summary

Introduction

Topological localization is a fundamental component for pedestrians and robots localization, navigation, and mobile mapping [1,2]. Represented by a graph, a topological map is a compact and memory-saving way to represent an environment, and is suitable for large-scale scene localization [3]. Each node of it indicates a region of the environment, which is associated with. The vital problem of the technique is to design robust and distinctive features to represent nodes identically. Many handcrafted features have been devised based on colors, gradients [3], lines [4], or distinctive points to represent the nodes. Most of them fail in dynamic indoor environments due to camera noise, illumination and perspective changes, or temporal variations

Objectives
Methods
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call