Abstract
Visual-Inertial Odometry (VIO) has been one of the most popular yet affordable navigation systems for indoor and even outdoor applications. VIO can augment or replace Global Navigation Satellite Systems (GNSSs) under signal degradation or service interruptions. Conventionally, the fusion of visual and inertial modalities has been performed using optimization-based or filtering-based techniques such as nonlinear Least Squares (LS) or Extended Kalman Filter (EKF). These classic techniques, despite several simplifying approximations, involve sophisticated modelling and parameterization of the navigation problem, which necessitates expert fine-tuning of the navigation system. In this work, a calibration-free visual-inertial fusion technique using Deep Convolutional Recurrent Neural Networks (DCRNN) is proposed. The network employs a Convolutional Neural Network (CNN) to process the spatial information embedded in visual data and two Recurrent Neural Networks (RNNs) to process the inertial sensor measurements and the CNN output for final pose estimation. The network is trained with raw Inertial Measurement Unit (IMU) data and monocular camera frames as its inputs, and the relative pose as its output. Unlike the conventional VIO techniques, there is no need for IMU biases and scale factors, intrinsic and extrinsic parameters of the camera to be explicitly provided or modelled in the proposed navigation system, rather these parameters along with system dynamics are implicitly learned during the training phase. Moreover, since the inertial and visual data are fused at mid-layers in the network, deeper correlations of these two modalities are learned compared to a simple combination of the final pose estimates of both modalities at the output layers, hence, the fusion can be considered as a tightly-coupled fusion of visual and inertial modalities. The proposed VIO network is evaluated on real datasets and thorough discussion is provided on the capabilities of the deep learning approach toward VIO.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.