Online Spatial and Temporal Calibration for Monocular Direct Visual-Inertial Odometry.

Zheyu Feng,Lundong Zhang,Jianwen Li,Chen Chen

doi:10.3390/s19102273

Zheyu Feng, Lundong Zhang + Show 2 more

Open Access

https://doi.org/10.3390/s19102273

Copy DOI

Abstract

Owing to the nonlinearity in visual-inertial state estimation, sufficiently accurate initial states, especially the spatial and temporal parameters between IMU (Inertial Measurement Unit) and camera, should be provided to avoid divergence. Moreover, these parameters are required to be calibrated online since they are likely to vary once the mechanical configuration slightly changes. Recently, direct approaches have gained popularity for their better performance than feature-based approaches in little-texture or low-illumination environments, taking advantage of tracking pixels directly. Based on these considerations, we perform a direct version of monocular VIO (Visual-inertial Odometry), and propose a novel approach to initialize the spatial-temporal parameters and estimate them with all other variables of interest (IMU pose, point inverse depth, etc.). We highlight that our approach is able to perform robust and accurate initialization and online calibration for the spatial and temporal parameters without utilizing any prior information, and also achieves high-precision estimates even when large temporal offset occurs. The performance of the proposed approach was verified through the public UAV (Unmanned Aerial Vehicle) dataset.

Highlights

The monocular visual-inertial system, which is usually composed of a low-cost MEMS (Micro-electromechanical Systems) IMU and a camera, has turned out to be a highly attractive solution for motion tracking and 3D reconstruction due to its lightweight characteristics of size, weight and power
DSO (Direct Sparse Odometry), which came from Engel [13], showed remarkable performance in weak intensity variation environments
The system starts with direct visual odometry

Summary

Introduction

The monocular visual-inertial system, which is usually composed of a low-cost MEMS (Micro-electromechanical Systems) IMU and a camera, has turned out to be a highly attractive solution for motion tracking and 3D reconstruction due to its lightweight characteristics of size, weight and power. With the maturity of feature tracking/matching techniques, feature-based approach has become a convention in visual-inertial algorithms. Most of these algorithms process image by tracking/matching sparse features, and minimize the reprojection error in the estimator [1,2,3,4,5,6,7,8,9,10]. DSO (Direct Sparse Odometry), which came from Engel [13], showed remarkable performance in weak intensity variation environments

Methods

Results

Conclusion